<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1471-2164-9-215</ui>
   <ji>1471-2164</ji>
   <fm>
      <dochead>Research article</dochead>
      <bibl>
         <title>
            <p>Complete Sequence and Analysis of the Mitochondrial Genome of <it>Hemiselmis andersenii </it>CCMP644 (Cryptophyceae)</p>
         </title>
         <aug>
            <au id="A1">
               <snm>Kim</snm>
               <fnm>Eunsoo</fnm>
               <insr iid="I1"/>
               <email>eunsookim@dal.ca</email>
            </au>
            <au id="A2">
               <snm>Lane</snm>
               <mi>E</mi>
               <fnm>Christopher</fnm>
               <insr iid="I1"/>
               <email>C.Lane@dal.ca</email>
            </au>
            <au id="A3">
               <snm>Curtis</snm>
               <mi>A</mi>
               <fnm>Bruce</fnm>
               <insr iid="I2"/>
               <email>Bbcurtis@genomeatlantic.ca</email>
            </au>
            <au id="A4">
               <snm>Kozera</snm>
               <fnm>Catherine</fnm>
               <insr iid="I2"/>
               <email>Catherine.Kozera@nrc-cnrc.gc.ca</email>
            </au>
            <au id="A5">
               <snm>Bowman</snm>
               <fnm>Sharen</fnm>
               <insr iid="I2"/>
               <insr iid="I3"/>
               <email>Sharen.Bowman@nrc-cnrc.gc.ca</email>
            </au>
            <au id="A6" ca="yes">
               <snm>Archibald</snm>
               <mi>M</mi>
               <fnm>John</fnm>
               <insr iid="I1"/>
               <email>jmarchib@dal.ca</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Canadian Institute for Advanced Research, Integrated Microbial Biodiversity Program, Department of Biochemistry and Molecular Biology, Dalhousie University, Halifax, Nova Scotia, Canada</p>
            </ins>
            <ins id="I2">
               <p>The Atlantic Genome Centre, Halifax, Nova Scotia, Canada</p>
            </ins>
            <ins id="I3">
               <p>Department of Process Engineering and Applied Science, Dalhousie University, Halifax, Nova Scotia, Canada</p>
            </ins>
         </insg>
         <source>BMC Genomics</source>
         <issn>1471-2164</issn>
         <pubdate>2008</pubdate>
         <volume>9</volume>
         <issue>1</issue>
         <fpage>215</fpage>
         <url>http://www.biomedcentral.com/1471-2164/9/215</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">18474103</pubid>
               <pubid idtype="doi">10.1186/1471-2164-9-215</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>19</day>
               <month>2</month>
               <year>2008</year>
            </date>
         </rec>
         <acc>
            <date>
               <day>12</day>
               <month>5</month>
               <year>2008</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>12</day>
               <month>5</month>
               <year>2008</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2008</year>
         <collab>Kim et al; licensee BioMed Central Ltd.</collab>
         <note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>Cryptophytes are an enigmatic group of unicellular eukaryotes with plastids derived by secondary (i.e., eukaryote-eukaryote) endosymbiosis. Cryptophytes are unusual in that they possess four genomes&#8211;a host cell-derived nuclear and mitochondrial genome and an endosymbiont-derived plastid and 'nucleomorph' genome. The evolutionary origins of the host and endosymbiont components of cryptophyte algae are at present poorly understood. Thus far, a single complete mitochondrial genome sequence has been determined for the cryptophyte <it>Rhodomonas salina</it>. Here, the second complete mitochondrial genome of the cryptophyte alga <it>Hemiselmis andersenii </it>CCMP644 is presented.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>The <it>H. andersenii </it>mtDNA is 60,553 bp in size and encodes 30 structural RNAs and 36 protein-coding genes, all located on the same strand. A prominent feature of the genome is the presence of a ~20 Kbp long intergenic region comprised of numerous tandem and dispersed repeat units of between 22&#8211;336 bp. Adjacent to these repeats are 27 copies of palindromic sequences predicted to form stable DNA stem-loop structures. One such stem-loop is located near a GC-rich and GC-poor region and may have a regulatory function in replication or transcription. The <it>H. andersenii </it>mtDNA shares a number of features in common with the genome of the cryptophyte <it>Rhodomonas salina</it>, including general architecture, gene content, and the presence of a large repeat region. However, the <it>H. andersenii </it>mtDNA is devoid of inverted repeats and introns, which are present in <it>R. salina</it>. Comparative analyses of the suite of tRNAs encoded in the two genomes reveal that the <it>H. andersenii </it>mtDNA has lost or converted its original <it>trnK(uuu) </it>gene and possesses a <it>trnS</it>-derived '<it>trnK(uuu)</it>', which appears unable to produce a functional tRNA. Mitochondrial protein coding gene phylogenies strongly support a variety of previously established eukaryotic groups, but fail to resolve the relationships among higher-order eukaryotic lineages.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusion</p>
               </st>
               <p>Comparison of the <it>H. andersenii </it>and <it>R. salina </it>mitochondrial genomes reveals a number of cryptophyte-specific genomic features, most notably the presence of a large repeat-rich intergenic region. However, unlike <it>R. salina</it>, the <it>H. andersenii </it>mtDNA does not possess introns and lacks a Lys-tRNA, which is presumably imported from the cytosol.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <meta>
      <classifications>
         <classification type="bmc" subtype="user_supplied_xml" id="endnote"/>
      </classifications>
   </meta>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>The mitochondrion is a double-membrane enclosed organelle found in the vast majority of extant eukaryotes. Mitochondria are best known for their essential role in energy generation, but they are also the site of additional important cellular processes such as iron-sulfur (Fe-S) cluster assembly and the beta-oxidation of fatty acids <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>. Some degenerate forms of mitochondria, such as the mitosome of the diplomonad parasite <it>Giardia lamblia</it>, have secondarily lost energy generating pathways and seem to retain only the Fe-S cluster maturation function <abbrgrp><abbr bid="B2">2</abbr></abbrgrp>. All mitochondria are believed to share a single origin from an &#945;-proteobacterial-like prokaryote <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>, but a wide diversity of mitochondrial genome architectures have evolved subsequent to the diversification of modern-day eukaryotes <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr></abbrgrp>. For example, whereas "derived" animals possess monomeric circular mitochondrial genomes, an observation which led to the initial assumption that mtDNAs are primarily circular <abbrgrp><abbr bid="B5">5</abbr></abbrgrp>, many other mitochondrial genomes, such as that of the ciliate <it>Tetrahymena pyriformis </it><abbrgrp><abbr bid="B6">6</abbr></abbrgrp>, the green alga <it>Chlamydomonas reinhardtii </it><abbrgrp><abbr bid="B7">7</abbr></abbrgrp> and the cnidarian metazoan <it>Aurelia aurita </it>(moon jelly) <abbrgrp><abbr bid="B8">8</abbr></abbrgrp> are linear <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>. In addition, while some fungi and many plants have circular-mapping mtDNAs, their mitochondria actually contain predominantly linear mtDNA molecules with combinations of monomers and concatemers, with only a minor fraction of the molecules present in a circular form <abbrgrp><abbr bid="B10">10</abbr></abbrgrp>. A more extreme example is the mtDNA of kinetoplastids, which consists of one maxi- and many different mini-circles that are interconnected to form an extensive network <abbrgrp><abbr bid="B11">11</abbr></abbrgrp>. Mitochondrial gene content is also highly variable; the mtDNA of the jakobid flagellate <it>Reclinomonas americana </it>encodes 97 genes, the largest set of mitochondrial genes currently known <abbrgrp><abbr bid="B12">12</abbr></abbrgrp>, whereas the mtDNA of the malaria parasite <it>Plasmodium falciparum </it>contains just 3 protein coding genes and 2 highly fragmented small and large subunit ribosomal RNA (rRNA) genes <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>. The most highly derived forms of mitochondria, such as the hydrogenosome of <it>Trichomonas vaginalis </it><abbrgrp><abbr bid="B14">14</abbr></abbrgrp> and the <it>Giardia lamblia </it>mitosome <abbrgrp><abbr bid="B2">2</abbr></abbrgrp>, have lost their genomes entirely <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>.</p>
         <p>Mitochondria are also known as sites of unusual molecular biology and biochemistry. Marande and Burger <abbrgrp><abbr bid="B3">3</abbr></abbrgrp> recently showed that the mtDNA genes of the euglenid <it>Diplonema papillatum </it>are fragmented into as many as nine modules, each residing on a distinct 6 or 7 Kbp chromosome. The mechanism by which these fragmented gene pieces are linked together to form contiguous transcripts is unknown. Extensive mRNA editing is another example of the bizarre molecular biology of mitochondria. Kinetoplastid mitochondrial mRNAs are subject to insertions and deletions of uridylate residues, sometimes >100 such insertions/deletions per transcript <abbrgrp><abbr bid="B16">16</abbr></abbrgrp>. Mitochondrial mRNA editing is also widespread in land plants <abbrgrp><abbr bid="B17">17</abbr></abbrgrp> and dinoflagellates <abbrgrp><abbr bid="B18">18</abbr></abbrgrp>. For example, ~2% of the <it>cox1 </it>and <it>cob </it>gene sequences in three dinoflagellate species investigated by Lin et al. <abbrgrp><abbr bid="B19">19</abbr></abbrgrp> were edited at the mRNA level.</p>
         <p>We are studying the genomic diversity and evolution of cryptophytes, a ubiquitous and ecologically significant group of single-celled eukaryotes found in freshwater and marine environments. Most cryptophytes, except for members of the genus <it>Goniomonas</it>, harbor plastids of secondary endosymbiotic origin <abbrgrp><abbr bid="B20">20</abbr></abbrgrp>. A variety of shared morphological features, such as the presence of ejectisomes, flat mitochondrial cristae, and an anterior depression, support the monophyly of cryptophytes, as do molecular phylogenetic data <abbrgrp><abbr bid="B21">21</abbr></abbrgrp>. One unique feature of cryptophyte plastids that distinguishes them from other plastids of red algal origin is the retention of the remnant nucleus of the red algal endosymbiont, referred to as the nucleomorph <abbrgrp><abbr bid="B22">22</abbr><abbr bid="B23">23</abbr></abbrgrp>. Consequently, most cryptophytes harbor four distinct genomes&#8211;nuclear, nucleomorph, mitochondrial, and plastid genomes&#8211;contained in separate compartments. Cryptophytes are thus an interesting model system with which to study endosymbiotic gene transfer, genome evolution, and protein targeting.</p>
         <p>In this study, we report the complete mitochondrial genome sequence of the newly described cryptophyte species <it>Hemiselmis andersenii </it>CCMP644, and compare it to the only other cryptophyte mitochondrial genome described thus far, that of <it>Rhodomonas salina </it><abbrgrp><abbr bid="B24">24</abbr></abbrgrp>. In addition, individual and concatenated mitochondrial protein coding gene sequences were analyzed to infer the phylogenetic relationships of cryptophytes to other eukaryotes.</p>
      </sec>
      <sec>
         <st>
            <p>Methods</p>
         </st>
         <sec>
            <st>
               <p>DNA preparation, sequencing, and genome assembly</p>
            </st>
            <p><it>Hemiselmis andersenii </it>mtDNA was isolated and sequenced to ~10&#215; coverage as described in Lane et al. <abbrgrp><abbr bid="B25">25</abbr></abbrgrp>. About 1,200 end sequences were screened for quality and vector contamination with Pregap4 and automatically assembled using gap4 version 4.10 in the Staden package <abbrgrp><abbr bid="B26">26</abbr></abbrgrp>. Complete automated assembly of a large intergenic space between <it>trnS </it>and <it>cox2 </it>was unsuccessful due to the highly repetitive nature of this region. In an attempt to manually resolve this area, short (~30 bp) unique sequences within the <it>trnS </it>and <it>cox2 </it>genes were used to probe the sequence database for reads that extended from these two loci into the repeat region. These sequences were extracted and manually aligned using MacClade version 4.08 <abbrgrp><abbr bid="B27">27</abbr></abbrgrp>. Sequences at the ends of the new constructs were then selected and the process was repeated. However, due to the presence of multiple identical copies of a >500 bp repeat, the assembly of a single unambiguous contig was not possible. When all available sequence reads were considered, three robust contigs were produced, each ending with similar repetitive sequences consisting of a ~340 bp repeat unit. These three contigs were joined to circularize the map. The complete <it>H. andersenii </it>mtDNA has been submitted to GenBank under the following accession number: <ext-link ext-link-type="gen" ext-link-id="EU651892">EU651892</ext-link>.</p>
            <p>DNA secondary structure within the repeat region was predicted using mfold version 3.2 <abbrgrp><abbr bid="B28">28</abbr></abbrgrp> at a folding temperature of 37&#176;C and the ionic conditions of 1.0 M [Na<sup>+</sup>] and 0.0 M [Mg<sup>++</sup>].</p>
         </sec>
         <sec>
            <st>
               <p>Genome size/structure determination</p>
            </st>
            <p>We used pulsed-field gel electrophoresis (PFGE) to obtain an independent size estimate of the <it>H. andersenii </it>mitochondrial genome. <it>Hemiselmis andersenii </it>total DNA plugs were prepared as described in Lane <it>et al</it>. <abbrgrp><abbr bid="B29">29</abbr></abbrgrp> and digested overnight with the restriction enzymes <it>Pst</it>I or <it>Bgl</it>II (Fermentas, Hanover, MD, USA). Based on the genome sequence, these enzymes were predicted to cut the mtDNA only once or twice. Both untreated and enzyme-digested <it>H. andersenii </it>DNA plugs were run on a 1% agarose gel (1&#215; TBE) in 0.5&#215; TBE buffer at 14.0&#176;C for 18 h at a voltage of 6.0 V/cm with a switch time between 1&#8211;25 s using a CHEF-DR III Pulsed-Field Gel Electrophoresis System (Bio-Rad Laboratories, Hercules, CA, USA). DNA on the pulsed-field gel was transferred to a nylon membrane. Southern hybridization using a ~700 bp <it>cox</it>I probe as in Lane and Archibald <abbrgrp><abbr bid="B30">30</abbr></abbrgrp> revealed that undigested mitochondrial DNA molecules were trapped in the wells or found in the 'compression zone'. The <it>Pst</it>I or <it>Bgl</it>II endonuclease treated DNA plugs revealed mitochondrial molecules in a discrete band below the 'compression zone'. The corresponding bands could not be visualized on the ethidium-bromide stained pulsed-field gel image because of nuclear and nucleomorph DNA smears in the background. In order to visualize mtDNA on the pulsed-field gel, an initial PFGE run was used to remove the linear nuclear and nucleomorph chromosomes from the PFGE plugs. These plugs, which still contained organellar DNA, were subsequently removed from the gel and digested with the restriction enzymes <it>Pst</it>I and <it>Bgl</it>II. Digested plugs were then inserted into a fresh gel and electrophoresed under the conditions described above. The 5 Kbp and Lambda CHEF DNA Size Standard (Bio-Rad Laboratories, Hercules, CA, USA) were used to estimate the size of the enzymatically linearized <it>H. andersenii </it>mtDNA.</p>
         </sec>
         <sec>
            <st>
               <p>Genome annotation and GC content/skew analyses</p>
            </st>
            <p>Annotation of the <it>H. andersenii </it>mtDNA and the GC content and skew analyses were performed in Artemis version 8 <abbrgrp><abbr bid="B31">31</abbr></abbrgrp>. Gene identification was carried out using BLASTX and BLASTN. Small and large ribosomal rRNA subunit genes were identified by comparison to rRNA gene sequences in the mitochondrial genome of <it>Rhodomonas salina</it>. Transfer RNAs were identified using tRNAscan-SE version 1.21 <abbrgrp><abbr bid="B32">32</abbr></abbrgrp>.</p>
         </sec>
         <sec>
            <st>
               <p>Genome rearrangements between the two cryptophyte mtDNA</p>
            </st>
            <p>The extent to which the <it>H. andersenii </it>and <it>R. salina </it>mitochondrial genomes are rearranged to each other was estimated using GRIMM <abbrgrp><abbr bid="B33">33</abbr></abbrgrp>. Each genome was designated as a sequence of 63 units, which include a repeat region and 62 genes common between the two cryptophyte mtDNAs.</p>
         </sec>
         <sec>
            <st>
               <p>RT-PCR of '<it>trnK(uuu)</it>'</p>
            </st>
            <p>tRNAscan-SE version 1.21 <abbrgrp><abbr bid="B32">32</abbr></abbrgrp> identified a putative intron of ~20 bp in the anticodon loop of the <it>H. andersenii trnK(uuu) </it>gene. To determine whether this prediction was correct, we performed RT-PCR using Lysine-tRNA-specific primer pairs and <it>H. andersenii </it>total RNA provided by H. Khan. To eliminate DNA contamination, 1 &#956;l of total RNA was incubated for 30 min with RQ1 RNase-Free Dnase (Promega, Madison, WI, USA). RT-PCR was performed using the QIAGEN one-step RT-PCR kit (QIAGEN, Valencia, CA, USA) and with control reactions in which the reverse-transcription process was skipped. The following two pairs of primers were used: 1) The forward primer 5'-GAAGGTTGCTCGAATGGAA-3' with the reverse primer 5'-GAAGGTATAGGAATTGAACCTATTC-3' 2) and the forward primer 5'-GCCCAGAAGGTTGCTC-3' with the reverse primer 5'-AAGAAGGTATAGGAATTGAACCTAT-3'. RT-PCR was performed with the reverse transcription step for 30 min at 50&#176;C and the subsequent inactivation of reverse transcriptase and activation of HotStart Taq DNA polymerase for 15 min at 95&#176;C, followed by 35 cycles at 94&#176;C for 1 min, 47&#176;C for 1 min, and 72&#176;C for 1 min, and a final extension at 72&#176;C for 10 min. The amplified PCR fragments were cloned into pCR4-TOPO vector in the TOPO TA cloning kit for sequencing (Invitrogen, Carlsbad, CA, USA). Between 5 and 10 bacterial colonies from each reaction were selected for sequencing on a Beckman Coulter CEQ8000 (Beckman Coulter Inc., Fullerton, California, USA).</p>
         </sec>
         <sec>
            <st>
               <p>Molecular phylogenetic analysis</p>
            </st>
            <p>From the 36 protein-coding genes found in the <it>H. andersenii </it>mtDNA, 25 were selected for phylogenetic analyses. Eleven genes (<it>atp8, nad8, rps2, rps3, rps4, rps7, rps8, rps13, rpl5, rpl6, tatC</it>) were excluded because their sequences were poorly conserved and/or were only present in a few taxonomic groups. <it>H. andersenii </it>protein sequences were aligned with their homologs from other mitochondrial genomes available from GenBank. Amino acid sequences were aligned using MacClade version 4.08 <abbrgrp><abbr bid="B27">27</abbr></abbrgrp> and ambiguously aligned sites were manually removed. In addition to individual protein analyses, a concatenated protein data set containing 25 proteins was analyzed. To include the maximum number of gene sequences, we combined 25 protein-coding gene sequences encoded in 18 mitochondrial genomes across diverse eukaryotic taxa. As most mitochondrial genomes do not possess all 25 protein-coding genes selected for analysis, as many as 12 protein gene sequences were missing per taxon. A maximum likelihood tree was produced using RAxML-VI-HPC version 2.2.3 <abbrgrp><abbr bid="B34">34</abbr></abbrgrp> with the PROTOMIXJTT model of sequence evolution and the automatic tree rearrangement setting, and from 100 distinct randomized maximum parsimony starting trees. Bootstrap analysis was based on 100 re-samplings.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Results and Discussion</p>
         </st>
         <sec>
            <st>
               <p>General features of <it>Hemiselmis andersenii </it>mtDNA</p>
            </st>
            <p>The mitochondrial DNA of the cryptophyte <it>Hemiselmis andersenii </it>CCMP644 was sequenced, assembled and manually edited to produce a circular-mapping genome 60,553 bp in size (Figure <figr fid="F1">1</figr>). Genome assembly was complicated by the presence of a highly repetitive non-coding region of ~20 Kbp (see below); genome size was thus verified using pulsed-field gel electrophoresis (PFGE). Several observations suggest that the <it>H. andersenii </it>mtDNA exists primarily in a linear-branched form comprised of multiple genome units. In PFGE, the <it>H. andersenii </it>mtDNA remains in the well or migrates within the 'compression zone' (i.e., the unresolved portion of DNA near the top of the gel), which contains primarily linear nuclear and nucleomorph chromosomes larger than ~150 Kbp (data not shown). The lack of mtDNA below the 'compression zone' suggests that the <it>H. andersenii </it>mtDNA is not composed of linear monomers or dimers. Furthermore, when the <it>H. andersenii </it>mtDNA is partially digested with <it>Pst</it>I, an enzyme predicted to cut the genome only once, it produces a discrete band of ~60 Kbp in size (data not shown) but not a band ~120 Kbp in size, which would correspond to a dimeric linear form of the genome. This result indicates that the <it>H. andersenii </it>mtDNA is not composed of circular concatemers or linear head-to-tail concatemers consisting of three or more genomic units. Therefore, we suggest that the <it>H. andersenii </it>mtDNA exists primarily as a branched linear molecule although monomeric circles may also exist. Further studies using transmission electron microscopy or the 'moving picture' technique <abbrgrp><abbr bid="B10">10</abbr></abbrgrp> will be necessary to confirm this hypothesis.</p>
            <fig id="F1">
               <title>
                  <p>Figure 1</p>
               </title>
               <caption>
                  <p>Circular map of the mitochondrial genome of the cryptophyte <it>Hemiselmis andersenii</it></p>
               </caption>
               <text>
                  <p><b>Circular map of the mitochondrial genome of the cryptophyte <it>Hemiselmis andersenii</it></b>. All of the genes are transcribed in a clockwise direction. Note the dense gene arrangement and a single large intergenic region. Protein-coding genes and ribosomal RNA genes are labeled outside the circle, whereas transfer RNAs and open reading frames of unknown functions are labeled on the inside. '<it>TrnK(uuu)</it>' may be a pseudogene (see main text for discussion). Genes are color-coded according to functional categories: green for ribosomal protein genes, gray for genes involved in oxidative phosphorylation, pink for the protein translocase protein gene <it>tatC</it>, salmon for ribosomal subunit genes, and black for open reading frames with unknown functions.</p>
               </text>
               <graphic file="1471-2164-9-215-1"/>
            </fig>
            <p>The <it>H. andersenii </it>mitochondrial genome is comprised of a gene-rich region ~40 Kbp in size and a large (19,675 bp) intergenic region between <it>trnS </it>and <it>cox2 </it>with complex repeats (Figures <figr fid="F1">1</figr> and <figr fid="F2">2</figr>). The intergenic region accounts for 32.5% of the entire genome and 83.5% of the total amount of non-coding DNA (23,549 bp). The overall GC content of the genome is 28.72%, slightly higher than that of the nucleomorph genome of this organism <abbrgrp><abbr bid="B25">25</abbr></abbrgrp>. Interestingly, a ~40 bp region near the start of the coding portion of the genome is very GC-rich (78.38%) and is followed by a 100% AT-containing region ~190 bp in size (Figure <figr fid="F3">3</figr>). This unusual stretch of sequence is about 70 bp from a palindromic sequence that is predicted to form a Type II stem-loop (Figures <figr fid="F2">2</figr> and <figr fid="F3">3</figr>; see discussion below), and may be involved in regulating replication or transcription.</p>
            <fig id="F2">
               <title>
                  <p>Figure 2</p>
               </title>
               <caption>
                  <p>Schematic diagram of the large repeat region in the <it>Hemiselmis andersenii </it>mitochondrial genome</p>
               </caption>
               <text>
                  <p><b>Schematic diagram of the large repeat region in the <it>Hemiselmis andersenii </it>mitochondrial genome</b>. This region is ~20 Kb in size and includes multiple repeat units arranged in tandem or dispersed among tandem repeats. Slight variations of each repeat unit are color-coded and/or marked with strips or a star symbol. Predicted nucleotide deletions within a repeat unit are highlighted with arrowheads: the size of the deletion is also provided. The positions of three kinds of DNA stem-loop forming sequences, Type I-a, I-b, and II, are labeled with "hairpin" symbols.</p>
               </text>
               <graphic file="1471-2164-9-215-2"/>
            </fig>
            <fig id="F3">
               <title>
                  <p>Figure 3</p>
               </title>
               <caption>
                  <p>(A) Isolated region of the <it>Hemiselmis andersenii </it>mitochondrial genome located near the 3' end of the large intergenic space</p>
               </caption>
               <text>
                  <p><b>(A) Isolated region of the <it>Hemiselmis andersenii </it>mitochondrial genome located near the 3' end of the large intergenic space.</b> This area includes a palindromic sequence that is predicted to form a Type II stem-loop followed by high and low GC regions. A similar region is not found in the <it>R. salina </it>mtDNA. <b>(B) Three predicted stem-loop structures found within the large intergenic space.</b> Note that Type I-a and I-b differ only by 3 nucleotides.</p>
               </text>
               <graphic file="1471-2164-9-215-3"/>
            </fig>
            <p>The <it>H. andersenii </it>mitochondrial genome encodes 66 genes with predicted functions and 8 hypothetical protein-coding genes, a total somewhat higher than the average for eukaryotes (40&#8211;50 genes) <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>. Ten genes&#8211;<it>orf167, orf71, rps13, rps11, nad3, rps2, tatC, 'trnK (uuu)', rps12</it>, and <it>rps7</it>&#8211;overlap by up to 51 bp, emphasizing the extreme compactness of the coding portion of the genome. The genome encodes small and large rRNA subunit genes and 28 tRNAs, one of which may be a pseudogene (see discussion below). Of the 36 identifiable protein-coding genes, 14 encode ribosomal proteins, 21 are involved in oxidative phosphorylation, and one gene encodes a membrane translocase protein (Table <tblr tid="T1">1</tblr>).</p>
            <tbl id="T1">
               <title>
                  <p>Table 1</p>
               </title>
               <caption>
                  <p>Functional categories of 36 protein genes encoded in the mitochondrial genome of <it>Hemiselmis andersenii</it>.</p>
               </caption>
               <tblbdy cols="3">
                  <r>
                     <c ca="left">
                        <p>Protein categories</p>
                     </c>
                     <c ca="left">
                        <p>Sub-categories</p>
                     </c>
                     <c ca="left">
                        <p>Genes</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="3">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Ribosomal proteins (14)</p>
                     </c>
                     <c ca="left">
                        <p>Small subunit</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>rps2 rps3 rps4 rps7 rps8 rps11 rps12 rps13 rps14 rps19</it>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Large subunit</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>rpl5 rpl6 rpl14 rpl16</it>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Oxidative phosphorylation (21)</p>
                     </c>
                     <c ca="left">
                        <p>NADH dehydrogenase</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>nad1 nad2 nad3 nad4 nad4L nad5 nad6 nad7 nad8 nad9 nad10 nad11</it>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Ubiquinol:cytochrome c oxidoreductase</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>cob</it>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Succinate:ubiquinone oxidoreductase</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>sdh3</it>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Cytochrome c oxidase</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>cox1 cox2 cox3</it>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>ATP synthase</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>atp1 atp6 atp8 atp9</it>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Sec-independent protein translocase protein (1)</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>
                           <it>tatC</it>
                        </p>
                     </c>
                  </r>
               </tblbdy>
            </tbl>
            <p>Comparison of the mtDNA gene order in <it>H. andersenii </it>to other genomes reveals the presence of five gene clusters shared among distantly related protists: two ribosomal protein clusters (<it>rps12-rps7-rps19-rps3-rpl16-rpl14-rpl5-rps14 </it>and <it>rps8-rpl6-rps13-rps11</it>) and three NADH dehydrogenase clusters (<it>nad4L-nad5</it>; <it>nad4-nad2</it>; <it>nad10-nad9</it>). These gene clusters have been suggested to represent vestiges of bacterial operons <abbrgrp><abbr bid="B12">12</abbr><abbr bid="B24">24</abbr></abbrgrp>. Interestingly, all 74 genes in the <it>H. andersenii </it>mitochondrial genome are encoded on the same strand. While the evolution of such an arrangement seems improbable, absolute strand polarity has been observed in the mitochondrial genomes of diverse eukaryotes such as the amoeba <it>Acanthamoeba castellanii </it>(59 genes), the fungus <it>Penicillium marneffei </it>(47 genes), and the green alga <it>Chlamydomonas eugametos </it>(20 genes) <abbrgrp><abbr bid="B35">35</abbr><abbr bid="B36">36</abbr><abbr bid="B37">37</abbr></abbrgrp>. In addition, strikingly similar mtDNA architectures&#8211;gene-dense regions, a single large repetitive intergenic region, and all genes encoded on one strand&#8211;are seen in diverse protists such as the stramenopile <it>Thraustochytrium aureum </it>(The Organelle Genome Megasequencing Program; http://megasun.bch.umontreal.ca/ogmp/) and the green alga <it>Pedinomonas minor </it><abbrgrp><abbr bid="B38">38</abbr></abbrgrp>. Understanding the biological significance of such convergence at the level of genome architecture will require comparative molecular and biochemical studies of mitochondria in these organisms.</p>
         </sec>
         <sec>
            <st>
               <p>Comparison of the mitochondrial genomes of <it>Hemiselmis andersenii </it>and <it>Rhodomonas salina</it></p>
            </st>
            <p><it>H. andersenii </it>is only the second cryptophyte, after <it>R. salina </it><abbrgrp><abbr bid="B24">24</abbr></abbrgrp>, for which a mitochondrial genome has been completely sequenced and annotated. Comparative analyses of the two genomes revealed a number of similarities. Both genomes feature a compact gene arrangement and a single large repeat region (Figure <figr fid="F1">1</figr>) <abbrgrp><abbr bid="B24">24</abbr></abbrgrp>, although the size of the large intergenic region in <it>H. andersenii </it>(~20 Kbp) is more than four times as large as that of <it>R. salina </it>(~4.7 Kbp). All of the 36 predicted protein-coding genes in the <it>H. andersenii </it>mitochondrial genome are present in the <it>R. salina </it>mtDNA. Four <it>R. salina </it>mitochondrion-encoded genes&#8211;<it>rps1, atp4, tatA</it>, and <it>sdh4</it>&#8211;are not found in <it>H. andersenii</it>, although two open reading frames, <it>orf45 </it>and <it>orf91</it>, in the <it>H. andersenii </it>mtDNA show marginal sequence similarity to the <it>R. salina tatA </it>and <it>sdh4 </it>genes, respectively. Additionally, while two group II introns are present in <it>R. salina </it>mtDNA, the <it>H. andersenii </it>mtDNA is devoid of introns (Table <tblr tid="T2">2</tblr>) <abbrgrp><abbr bid="B24">24</abbr></abbrgrp>.</p>
            <tbl id="T2">
               <title>
                  <p>Table 2</p>
               </title>
               <caption>
                  <p>Comparison of two cryptophyte mitochondrial genomes</p>
               </caption>
               <tblbdy cols="3">
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Hemiselmis andersenii</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Rhodomonas salina</it>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="3">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Genome size</p>
                     </c>
                     <c ca="left">
                        <p>60, 553 bp</p>
                     </c>
                     <c ca="left">
                        <p>48,063 bp</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Coding capacity</p>
                     </c>
                     <c ca="left">
                        <p>61%</p>
                     </c>
                     <c ca="left">
                        <p>69%</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>GC %</p>
                     </c>
                     <c ca="left">
                        <p>29%</p>
                     </c>
                     <c ca="left">
                        <p>29%</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Size of the repeat region</p>
                     </c>
                     <c ca="left">
                        <p>19.7 Kbp (33%)</p>
                     </c>
                     <c ca="left">
                        <p>4.7 Kbp (10%)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Group II introns</p>
                     </c>
                     <c ca="left">
                        <p>not present</p>
                     </c>
                     <c ca="left">
                        <p>two</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Number of genes (with assignable functions)</p>
                     </c>
                     <c ca="left">
                        <p>66 genes (28 tRNAs)</p>
                     </c>
                     <c ca="left">
                        <p>69 genes (27 tRNAs)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Inverted repeats</p>
                     </c>
                     <c ca="left">
                        <p>not present</p>
                     </c>
                     <c ca="left">
                        <p>a pair of ~1.5 Kbp repeat units</p>
                     </c>
                  </r>
               </tblbdy>
            </tbl>
            <p>With respect to conservation of gene order, 64.5% of the shared genes between the two cryptophyte mitochondrial genomes (40 out of 62 genes&#8211;36 protein-coding genes, 24 tRNA genes (see below), 2 rRNA genes) are present in thirteen syntenic blocks, each consisting of 2&#8211;7 genes. These include: 1) <it>cox1-cob-nad11</it>, 2) <it>nad4L-nad5</it>, 3) <it>atp1-trnP(ugg)</it>, 4) <it>rps8-rpl6-rps13-rps11</it>, 5) <it>trnC(gca)-atp6</it>, 6) <it>trnI(gau)-trnQ(uug)-trnR(gcg)-trnE(uuc)-trnW(cca)-nad10-nad9</it>, 7) <it>nad4-nad2</it>, 8) <it>trnR(ucu)-trnG(ucc)</it>, 9) <it>trnM(cau)f-trnS(uga)</it>, 10) <it>trnY(gua)-trnL(uag)</it>, 11) <it>tatC-'trnK(uuu)' </it>[<it>H. andersenii</it>] <it>/trnS(gcu) </it>[<it>R. salina</it>]-<it>nad7</it>, 12)<it>cox3-rps12-rps7-rps19</it>, and 13) <it>rps3-rpl16-rpl14-rpl5-rps14</it>. As noted earlier, some of the conserved gene clusters, such as <it>nad4L-nad5</it>, are found in distantly related eukaryotes and appear to be vestiges of bacterial operons. Analysis using GRIMM <abbrgrp><abbr bid="B33">33</abbr></abbrgrp> suggests that the observed difference in gene order between the two cryptophyte mitochondrial genomes can be explained by at least 31 instances of genome reversal events.</p>
         </sec>
         <sec>
            <st>
               <p>Repeat structure of the <it>H. andersenii </it>mitochondrial genome</p>
            </st>
            <p>The <it>R. salina </it>mtDNA is characterized by a pair of ~1.5 Kbp inverted repeats that are joined by 112 bp of sequence <abbrgrp><abbr bid="B24">24</abbr></abbrgrp>. In contrast, repeats in the <it>H. andersenii </it>mitochondrial genome are not inverted, but are instead dispersed or arranged in tandem throughout the large non-coding region, with individual repeat units ranging from 22 to 336 bp and occurring up to 100 times (Figure <figr fid="F2">2</figr>). Given that <it>R. salina </it>and <it>H. andersenii </it>are distantly related to one another <abbrgrp><abbr bid="B29">29</abbr></abbrgrp>, the large repeat region presumably arose during or prior to the early diversification of cryptophytes. While there is no obvious sequence similarity between the two repeat regions, both contain multiple copies of palindromic sequences, which are predicted to form stable stem-loop DNA structures <abbrgrp><abbr bid="B24">24</abbr></abbrgrp>. In <it>H. andersenii</it>, two types of stem-loop structures were identified&#8211;I and II&#8211;using the DNA MFOLD program <abbrgrp><abbr bid="B28">28</abbr></abbrgrp>. The Type I structure has two slight variations, I-a and I-b, which occur 21 and 5 times, respectively (Figures <figr fid="F2">2</figr> and <figr fid="F3">3</figr>). Type I-a and I-b structures have 22 and 20 base pairings in their stems, respectively, and occur adjacent to tandem repeats (Figures <figr fid="F2">2</figr> and <figr fid="F3">3</figr>). One copy of the type II stem-loop structure is located within a ~300 bp segment that is devoid of any discernable repeat units, but close to the high and low GC regions noted earlier (Figures <figr fid="F2">2</figr> and <figr fid="F3">3</figr>). As was suggested for <it>R. salina </it>by Hauth et al. <abbrgrp><abbr bid="B24">24</abbr></abbrgrp>, tandem repeats and multiple stem-loop structures in <it>H. andersenii </it>mtDNA might be involved in the regulation of transcription and replication, a hypothesis that needs to be tested further.</p>
            <p>Hauth et al. <abbrgrp><abbr bid="B24">24</abbr></abbrgrp> demonstrated that the repeat region of the <it>R. salina </it>mtDNA roughly coincides with a change in the direction of 'cumulative GC skew' [calculated as (G-C)/(G+C)] and suggested that the repeat corresponds to the origin of replication. We investigated the GC skew in the <it>H. andersenii </it>mitochondrial genome to see whether a similar pattern exists. Unlike <it>R. salina</it>, however, the <it>H. andersenii </it>GC skew does not change direction near the repeat region. Instead, in both the <it>H. andersenii </it>and <it>R. salina </it>mtDNA, observed GC skew patterns strongly correlate with transcriptional orientations, where the coding strand tends to be G-rich (data not shown). Therefore, the GC skew patterns of the two cryptophyte mitochondrial genomes do not seem to be the result of replication-associated mutational bias, but rather the non-random distribution of the protein coding genes, as has been observed in some other genomes <abbrgrp><abbr bid="B39">39</abbr></abbrgrp>. Nevertheless, based on the presence of other features such as stem-loop structures, it seems reasonable to assume that the repeat region in both cryptophyte mitochondrial genomes corresponds to the origin of replication.</p>
         </sec>
         <sec>
            <st>
               <p>Codon usage and transfer RNAs</p>
            </st>
            <p>The <it>H. andersenii </it>mtDNA encodes 28 tRNAs, 27 of which are predicted to form standard cloverleaf secondary structures. One tRNA gene, '<it>trnK(uuu)</it>', shows atypical structure in the anticodon loop and the variable region, and is probably a pseudogene (Figure <figr fid="F4">4A</figr>). Allowing for wobble pairings and some base modifications, 26 tRNAs are the theoretical minimum required to cover all codons in bacteria. For some mitochondria, even smaller sets of tRNAs, as few as 22&#8211;23, are possible by adopting several additional strategies <abbrgrp><abbr bid="B40">40</abbr></abbrgrp>. The <it>H. andersenii </it>mitochondrial genome lacks only one tRNA gene, <it>trnK(uuu)</it>, which is minimally required in order to recognize all 61 codons (Table <tblr tid="T3">3</tblr>). It is thus predicted that nuclear-encoded cytosolic Lys-tRNA is imported into <it>H. andersenii </it>mitochondria. Mitochondrial tRNA import has been demonstrated in apicomplexans and trypanosomatids where tRNA genes are completely missing in their mitochondrial genomes <abbrgrp><abbr bid="B41">41</abbr></abbrgrp>, as well as in ciliates and plants where mitochondrial genomes encode fewer than the 22&#8211;23 minimally required tRNA genes <abbrgrp><abbr bid="B42">42</abbr></abbrgrp>. Although most animals and some fungi do not import tRNAs into mitochondria <abbrgrp><abbr bid="B43">43</abbr></abbrgrp>, the fungus <it>Saccharomyces cerevisiae </it>has been shown to import one specific cytosolic tRNA even though its mitochondrial genome encodes the full complement of tRNAs <abbrgrp><abbr bid="B44">44</abbr></abbrgrp>. Analyses of the tRNA repertoire of mitochondrial genomes suggest that a number of other protist taxa across the eukaryotic tree also import one or more tRNAs into their mitochondria <abbrgrp><abbr bid="B43">43</abbr><abbr bid="B45">45</abbr></abbrgrp>. It is thus reasonable to assume that <it>H. andersenii </it>imports at least Lys-tRNA, although it is possible that tRNA editing makes up for the Lys-tRNA deficit by changing the identity of an existing tRNA, as has been shown in marsupials <abbrgrp><abbr bid="B46">46</abbr></abbrgrp>.</p>
            <fig id="F4">
               <title>
                  <p>Figure 4</p>
               </title>
               <caption>
                  <p>Predicted secondary structures for three homologous cryptophyte tRNAs</p>
               </caption>
               <text>
                  <p><b>Predicted secondary structures for three homologous cryptophyte tRNAs.</b><it>Hemiselmis andersenii </it>'<it>trnK(uuu)' </it>(A) is paralogous to <it>trnS(gcu) </it>(B). Duplication and divergence of an ancestral <it>trnS(gcu) </it>appears to have led to the evolution of '<it>trnK(uuu)</it>' in <it>H. andersenii </it>(A), which consequently, possesses an atypically long variable region. The anticodon loop and the variable region of '<it>trnK(uuu)</it>' is AT-rich and the predicted stem regions consist entirely of A-T base pairings. Furthermore, the loop of the anticodon loop/stem region is missing one nucleotide (arrow). These structural considerations suggest that '<it>trnK(uuu)</it>' may not be functional. The '<it>trnK(uuu)' </it>gene in <it>H. andersenii </it>(A) is orthologous to <it>trnS(gcu) </it>in <it>R. salina </it>(C). Note that while sequences within the anticodon loop, V-arm, and acceptor stem are divergent between the two orthologous copies, sequences within the D loop and the T loop are conserved.</p>
               </text>
               <graphic file="1471-2164-9-215-4"/>
            </fig>
            <tbl id="T3">
               <title>
                  <p>Table 3</p>
               </title>
               <caption>
                  <p><it>Hemiselmis andersenii </it>mtDNA codon usage table.</p>
               </caption>
               <tblbdy cols="8">
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c cspan="4" ca="center">
                        <p>Second Position of Codon</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c cspan="6">
                        <hr/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>U</p>
                     </c>
                     <c ca="center">
                        <p>C</p>
                     </c>
                     <c ca="center">
                        <p>A</p>
                     </c>
                     <c ca="center">
                        <p>G</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c cspan="6">
                        <hr/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>First Position of Codon</p>
                     </c>
                     <c ca="center">
                        <p>U</p>
                     </c>
                     <c ca="left">
                        <p>UUU [F] 608</p>
                     </c>
                     <c ca="left">
                        <p>UCU [S] 123</p>
                     </c>
                     <c ca="left">
                        <p>UAU [Y] 312</p>
                     </c>
                     <c ca="left">
                        <p>UGU [C] 123</p>
                     </c>
                     <c ca="left">
                        <p>U</p>
                     </c>
                     <c ca="left">
                        <p>Third Position of Codon</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>UUC [F] 126&#8226;</p>
                     </c>
                     <c ca="left">
                        <p>UCC [S] 18</p>
                     </c>
                     <c ca="left">
                        <p>UAC [Y] 69&#8226;</p>
                     </c>
                     <c ca="left">
                        <p>UGC [C] 20&#8226;</p>
                     </c>
                     <c ca="left">
                        <p>C</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>UUA [L] 688&#8226;</p>
                     </c>
                     <c ca="left">
                        <p>UCA [S] 205&#8226;</p>
                     </c>
                     <c ca="left">
                        <p>UAA [stop] 32</p>
                     </c>
                     <c ca="left">
                        <p>UGA [stop] 2</p>
                     </c>
                     <c ca="left">
                        <p>A</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>UUG [L] 163</p>
                     </c>
                     <c ca="left">
                        <p>UCG [S] 57</p>
                     </c>
                     <c ca="left">
                        <p>UAG [stop] 3</p>
                     </c>
                     <c ca="left">
                        <p>UGG [W] 120&#8226;</p>
                     </c>
                     <c ca="left">
                        <p>G</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c cspan="6">
                        <hr/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>C</p>
                     </c>
                     <c ca="left">
                        <p>CUU [L] 179</p>
                     </c>
                     <c ca="left">
                        <p>CCU [P] 139</p>
                     </c>
                     <c ca="left">
                        <p>CAU [H] 140</p>
                     </c>
                     <c ca="left">
                        <p>CGU [R] 83</p>
                     </c>
                     <c ca="left">
                        <p>U</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>CUC [L] 38&#8226;</p>
                     </c>
                     <c ca="left">
                        <p>CCC [P] 18</p>
                     </c>
                     <c ca="left">
                        <p>CAC [H] 30&#8226;</p>
                     </c>
                     <c ca="left">
                        <p>CGC [R] 17&#8226;</p>
                     </c>
                     <c ca="left">
                        <p>C</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>CUA [L] 88&#8226;</p>
                     </c>
                     <c ca="left">
                        <p>CCA [P] 115&#8226;</p>
                     </c>
                     <c ca="left">
                        <p>CAA [Q] 217&#8226;</p>
                     </c>
                     <c ca="left">
                        <p>CGA [R] 95&#8226;</p>
                     </c>
                     <c ca="left">
                        <p>A</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>CUG [L] 20</p>
                     </c>
                     <c ca="left">
                        <p>CCG [P] 28</p>
                     </c>
                     <c ca="left">
                        <p>CAG [Q] 42</p>
                     </c>
                     <c ca="left">
                        <p>CGG [R] 32</p>
                     </c>
                     <c ca="left">
                        <p>G</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c cspan="6">
                        <hr/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>A</p>
                     </c>
                     <c ca="left">
                        <p>AUU [I] 579</p>
                     </c>
                     <c ca="left">
                        <p>ACU [T] 162</p>
                     </c>
                     <c ca="left">
                        <p>AAU [N] 307</p>
                     </c>
                     <c ca="left">
                        <p>AGU [S] 183</p>
                     </c>
                     <c ca="left">
                        <p>U</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>AUC [I] 85&#8226;</p>
                     </c>
                     <c ca="left">
                        <p>ACC [T] 24</p>
                     </c>
                     <c ca="left">
                        <p>AAC [N] 95&#8226;</p>
                     </c>
                     <c ca="left">
                        <p>AGC [S] 29&#8226;</p>
                     </c>
                     <c ca="left">
                        <p>C</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>AUA [I] 207&#8226;</p>
                     </c>
                     <c ca="left">
                        <p>ACA [T] 242&#8226;</p>
                     </c>
                     <c ca="left">
                        <p>AAA [K] 517&#8226; &#8224;</p>
                     </c>
                     <c ca="left">
                        <p>AGA [R] 110&#8226;</p>
                     </c>
                     <c ca="left">
                        <p>A</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>AUG [M] 233&#8226;&#8226;</p>
                     </c>
                     <c ca="left">
                        <p>ACG [T] 54</p>
                     </c>
                     <c ca="left">
                        <p>AAG [K] 75</p>
                     </c>
                     <c ca="left">
                        <p>AGG [R] 15</p>
                     </c>
                     <c ca="left">
                        <p>G</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c cspan="6">
                        <hr/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>G</p>
                     </c>
                     <c ca="left">
                        <p>GUU [V] 311</p>
                     </c>
                     <c ca="left">
                        <p>GCU [A] 186</p>
                     </c>
                     <c ca="left">
                        <p>GAU [D] 221</p>
                     </c>
                     <c ca="left">
                        <p>GGU [G] 299</p>
                     </c>
                     <c ca="left">
                        <p>U</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>GUC [V] 38</p>
                     </c>
                     <c ca="left">
                        <p>GCC [A] 35</p>
                     </c>
                     <c ca="left">
                        <p>GAC [D] 39&#8226;</p>
                     </c>
                     <c ca="left">
                        <p>GGC [G] 41&#8226;</p>
                     </c>
                     <c ca="left">
                        <p>C</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>GUA [V] 204&#8226;</p>
                     </c>
                     <c ca="left">
                        <p>GCA [A] 225&#8226;</p>
                     </c>
                     <c ca="left">
                        <p>GAA [E] 283&#8226;</p>
                     </c>
                     <c ca="left">
                        <p>GGA [G] 131&#8226;</p>
                     </c>
                     <c ca="left">
                        <p>A</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>GUG [V] 62</p>
                     </c>
                     <c ca="left">
                        <p>GCG [A] 64</p>
                     </c>
                     <c ca="left">
                        <p>GAG [E] 65</p>
                     </c>
                     <c ca="left">
                        <p>GGG [G] 83</p>
                     </c>
                     <c ca="left">
                        <p>G</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>The frequency of each codon is shown and dots indicate the presence of a corresponding tRNA gene in the mtDNA. &#8224;: '<it>trnK(uuu)</it>' identified in the genome may be a pseudogene (see discussion).</p>
               </tblfn>
            </tbl>
            <p>Another possible mechanism to account for the missing tRNA is that the structurally abnormal '<it>trnK(uuu)</it>' gene (Figure <figr fid="F4">4A</figr>) forms a functional Lys-tRNA to decode the codons AAA and AAG. Several cases of atypically-structured tRNAs are known from animal and ciliate mitochondria <abbrgrp><abbr bid="B47">47</abbr><abbr bid="B48">48</abbr></abbrgrp>. Interestingly, tRNAscan-SE <abbrgrp><abbr bid="B32">32</abbr></abbrgrp> predicted the existence of a 20 bp intron within the <it>H. andersenii </it>'<it>trnK(uuu)'</it>, and we conducted further experiments to test whether this is indeed the case. RT-PCR experiments using primer sets specific for '<it>trnK(uuu)</it>' indicated that the putative intron was not removed in the mature tRNA. This results is not unexpected, given that the 20-bp putative intron is too short to be a self-splicing group I or II intron, which are the only known types of introns reported in mitochondrial genomes <abbrgrp><abbr bid="B49">49</abbr></abbrgrp>. Sequencing of ~20 clones also did not reveal any evidence for RNA editing within the '<it>trnK(uuu)</it>'. These results suggest that if '<it>trnK(uuu)</it>' is indeed expressed to form a functional Lys-tRNA, it is predicted to have an unusually AU-rich stem in the codon loop and a long variable region, atypical for Lys-tRNA (Figure <figr fid="F4">4A</figr>). Long variable regions ranging from 11 to 23 nucleotides are generally restricted to tRNA-Leu, tRNA-Ser, and bacterial tRNA-Tyr <abbrgrp><abbr bid="B40">40</abbr></abbrgrp>. The D- and T-loops of the '<it>trnK(uuu)</it>' sequence show sequence similarity to one of the two mitochondrion-encoded tRNA-Ser genes (Figure <figr fid="F4">4A</figr> and <figr fid="F4">4B</figr>), both of which have a long variable region. In addition, comparative analysis with the <it>R. salina </it>mtDNA revealed genomic position conservation between the <it>H. andersenii trnS</it>-like <it>'trnK(uuu)' </it>gene and the <it>trnS(gcu) </it>gene of <it>R. salina</it>, flanked by the <it>tatC </it>and <it>nad7 </it>genes. The <it>H. andersenii </it>'<it>trnK(uuu)</it>' and <it>R. salina trnS(gcu) </it>genes both overlap <it>tatC </it>by 51 bp and 22 bp, respectively. This strongly suggests that the <it>H. andersenii 'trnK(uuu)' </it>is indeed derived from an ancestral gene that encoded tRNA-Ser, explaining the origin of its long variable region. The overlap between the <it>H. andersenii </it>'<it>trnK(uuu)</it>' and <it>tatC </it>suggests that '<it>trnK(uuu)</it>' may play a role in processing the 3' end of the <it>tatC </it>gene transcript. This hypothesis could explain why the '<it>trnK(uuu)</it>' gene still remains in the genome and retains conserved secondary structure in the stem loop and D- and T-loops, even if it does not form a functional tRNA. Comprehensive molecular and biochemical experimentation will be necessary to confirm or refute the existence of mitochondrial tRNA import in <it>H. andersenii </it>and the functionality of the unusual '<it>trnK(uuu)</it>' gene.</p>
            <p>When the <it>H. andersenii </it>tRNA genes were compared to those of <it>R. salina</it>, 24 homologous pairs of tRNAs were identified, leaving only four <it>H. andersenii </it>tRNA and three <it>R. salina </it>tRNA genes not unambiguously matched to each other. Each of the tRNA pairs possess identical anticodons except for the <it>H. andersenii 'trnK(uuu)' </it>and <it>R. salina trnS(gcu) </it>pair, despite their common derivation. The <it>trnS(gcu) </it>of <it>H. andersenii</it>, having sequence homology to the '<it>trnK(uuu)</it>', probably originated from a recent gene duplication event. Of the three remaining <it>H. andersenii </it>tRNA genes that are unmatched in <it>R. salina</it>, two&#8211;<it>trnL(gag) </it>and <it>trnG(gcc)</it>&#8211;are redundant because <it>trnL(uag) </it>and <it>trnG(ucc) </it>can decode all of their respective four-codon families <abbrgrp><abbr bid="B40">40</abbr></abbrgrp>. These redundant copies might have been lost in an ancestor of <it>R. salina </it>after it diverged from <it>H. andersenii</it>. Lastly, the <it>H. andersenii trnI(cau) </it>is somewhat similar to the <it>trnK(uuu) </it>of the <it>R. salina </it>and only marginally resembles the <it>R. salina trnI(cau) </it>at the 3' end. It is possible that the <it>H. andersenii trnI(cau) </it>originated through recombination between ancestral <it>trnI(cau) </it>and <it>trnK(uuu) </it>genes, which would explain the lack of an obvious <it>trnK(uuu) </it>homolog in <it>H. andersenii </it>comparable to the <it>R. salina trnK(uuu)</it>. Substantial sequence divergence among the three genes, however, makes it difficult to accurately trace the origin of the <it>trnI(cau) </it>and the loss of the original <it>trnK(uuu) </it>gene in <it>H. andersenii</it>. On the other hand, the unusual <it>trnI(uau) </it>gene reported from <it>R. salina </it>is not found in <it>H. andersenii</it>. It was suggested that the <it>R. salina trnI(uau) </it>is derived from <it>trnF(uuc) </it>through a recent gene duplication event <abbrgrp><abbr bid="B24">24</abbr></abbrgrp>. Overall, the two cryptophyte mitochondrial genomes use similar tRNA sets to recognize codons. However, unlike <it>H. andersenii</it>, which may need to import at least <it>trnK(uuu) </it>from cytosol, the <it>R. salina </it>mtDNA does possess the minimal required set for tRNA autonomy.</p>
         </sec>
         <sec>
            <st>
               <p>Molecular phylogenetic analyses</p>
            </st>
            <p>Cryptophytes are a well-established eukaryotic lineage, supported by both molecular and morphological features <abbrgrp><abbr bid="B20">20</abbr></abbrgrp>. However, their relationship to other eukaryotic groups, particularly those containing plastids of secondary endosymbiotic origin, has been the subject of considerable debate. The cryptophyte plastid is the product of a secondary endosymbiosis involving a red algal cell, the same process which accounts for plastid origins in haptophytes, dinoflagellates, and stramenopiles <abbrgrp><abbr bid="B50">50</abbr></abbrgrp>. Cavalier-Smith <abbrgrp><abbr bid="B50">50</abbr></abbrgrp> suggested that plastids in these four algal lineages arose from a single secondary endosymbiosis in a common ancestor that these organisms shared, to the exclusion of other eukaryotic groups. However, this "chromalveolate" hypothesis is controversial <abbrgrp><abbr bid="B51">51</abbr><abbr bid="B52">52</abbr></abbrgrp>. Recent molecular studies have shown that the katablepharids, an enigmatic collection of plastid-less flagellates, are a sister group to cryptophytes <abbrgrp><abbr bid="B53">53</abbr><abbr bid="B54">54</abbr></abbrgrp>, and large-scale concatenated analyses of nuclear genes suggest that cryptophytes and haptophytes are also related <abbrgrp><abbr bid="B55">55</abbr><abbr bid="B56">56</abbr></abbrgrp>.</p>
            <p>To gain insight into the phylogenetic relationship of the cryptophytes <it>H. andersenii </it>and <it>R. salina </it>to other eukaryotes, and more specifically, to test the hypothesis that cryptophytes and haptophytes are related to one another, phylogenetic analyses of mitochondrial protein sequences were performed (Figure <figr fid="F5">5</figr>). Unlike the cryptophyte plastid genome, in which several cases of LGT have recently been discovered <abbrgrp><abbr bid="B57">57</abbr><abbr bid="B58">58</abbr></abbrgrp>, individual analyses of 25 mitochondrial proteins did not reveal any obvious instances of LGT between prokaryotes and eukaryotes or within eukaryotes (data not shown). However, the possibility of ancient LGTs cannot be ruled out, as the backbones of individual protein phylogenies were generally very poorly supported.</p>
            <fig id="F5">
               <title>
                  <p>Figure 5</p>
               </title>
               <caption>
                  <p>Maximum likelihood phylogenetic tree based on 25 concatenated mitochondrial protein sequences inferred using RAxML</p>
               </caption>
               <text>
                  <p><b>Maximum likelihood phylogenetic tree based on 25 concatenated mitochondrial protein sequences inferred using RAxML.</b> Approximately 15% of data are missing. Bootstrap support values over 50% are indicated at the corresponding nodes.</p>
               </text>
               <graphic file="1471-2164-9-215-5"/>
            </fig>
            <p>As expected, a close relationship between the two cryptophytes <it>H. andersenii </it>and <it>R. salina </it>was well supported in the mitochondrial protein phylogenies, with twenty of twenty-five individual protein phylogenies showing this relationship. Five individual gene phylogenies&#8211;<it>nad2, rpl14, rpl16, rps12, rps14</it>&#8211;did not recover a <it>H. andersenii</it>-<it>R. salina </it>clade, although alternative topologies were not supported with >50% bootstrap support values. Additionally, single protein phylogenies were not, for the most part, able to resolve the relationship of cryptophytes to other eukaryotes. The position of cryptophytes was highly variable from protein to protein and the group did not regularly associate with other taxonomic clades with >50% bootstrap support values, except for in the <it>cob </it>and <it>nad1 </it>gene trees, where cryptophytes branch with haptophytes (81%) and jakobids (77%), respectively.</p>
            <p>We subsequently analyzed a set of 25 concatenated proteins to assess the phylogenetic position of cryptophytes. In this analysis, the <it>H. andersenii</it>-<it>R. salina </it>clade received 100% bootstrap support (Figure <figr fid="F5">5</figr>). Other well-established eukaryotic groups including opisthokonts, rhodophytes, stramenopiles, and Viridiplantae, were also strongly recovered, but the relationships among major lineages were not. The jakobid <it>Reclinomonas </it>branched as the sister group to the Viridiplantae with moderate support (89% bootstrap support), and <it>Malawimonas </it>showed an affinity for these two groups in two of the three data sets, as was previously inferred from a concatenate of ten mitochondrial proteins <abbrgrp><abbr bid="B59">59</abbr></abbrgrp>. It is not clear whether the jabokid (and/or malawimonad)-Viridiplantae affinity is a phylogenetic artifact or reflects the true evolutionary history of mitochondrial genes. Though growing evidence supports a relationship between cryptophytes and haptophytes <abbrgrp><abbr bid="B55">55</abbr><abbr bid="B56">56</abbr><abbr bid="B58">58</abbr></abbrgrp>, our extensive mitochondrial protein analyses did not reveal this relationship with reasonable bootstrap support, other than in a single protein gene tree (<it>cob)</it>. In summary, while mitochondrial gene sequences are able to resolve some of the eukaryotic lineages determined using other markers, they are at present incapable of resolve the deepest branches of the eukaryotic tree using current phylogenetic methods and with the present level of taxon sampling.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Conclusion</p>
         </st>
         <p>We have sequenced the mitochondrial genome of the cryptophyte <it>H. andersenii </it>and compared it to that of the distantly related cryptophyte <it>R. salina</it>. Our analyses reveal that both genomes are characterized by a gene dense region and a single large intergenic space that includes numerous repeats and palindromic sequences predicted to form stable DNA stem and loop structures. Despite the overall similarities in content and architecture between the two genomes, their modes of regulating DNA replication and transcription seem to differ. Unlike <it>R. salina</it>, all 73 genes in the <it>H. andersenii </it>mtDNA are located on the same strand, a relatively rare observation in mitochondrial genomes. Phylogenic analysis of multiple mitochondrial gene sequences indicated a clear affiliation between the two cryptophytes but was not able to resolve the position of cryptophytes relative to other eukaryotic groups.</p>
      </sec>
      <sec>
         <st>
            <p>Authors' contributions</p>
         </st>
         <p>EK participated in genome assembly, carried out genome analysis and drafted the manuscript. CEL isolated <it>H. andersenii </it>DNA and participated in the initial genome assembly. BAC, CK, and SB performed the <it>H. andersenii </it>mitochondrial genome sequencing. JMA coordinated the study and helped draft the manuscript. All authors read and approved the manuscript.</p>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>We thank D. Spencer for discussion, J. Leigh for mitochondrial protein sequence alignments, H. Khan for <it>H. andersenii </it>RNA, A. Roger for help with phylogenetic analyses, and D. Spencer and H. Khan for helpful comments on the manuscript. A. Bendich is acknowledged for providing insight on the probable <it>in vivo </it>structure of <it>H. andersenii </it>mtDNA. This work was supported by Genome Atlantic and a Natural Sciences and Engineering Research Council of Canada Discovery Grant (28335-04) awarded to JMA. EK receives postdoctoral fellowship support from the Tula Foundation. JMA is a Scholar of the Canadian Institute for Advanced Research, Program in Integrated Microbial Biodiversity.</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>Mitochondrial genomes: anything goes</p>
            </title>
            <aug>
               <au>
                  <snm>Burger</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Gray</snm>
                  <fnm>MW</fnm>
               </au>
               <au>
                  <snm>Lang</snm>
                  <fnm>BF</fnm>
               </au>
            </aug>
            <source>Trends Genet</source>
            <pubdate>2003</pubdate>
            <volume>19</volume>
            <issue>12</issue>
            <fpage>709</fpage>
            <lpage>716</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.tig.2003.10.012</pubid>
                  <pubid idtype="pmpid" link="fulltext">14642752</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <title>
               <p>Mitochondrial remnant organelles of Giardia function in iron-sulphur protein maturation</p>
            </title>
            <aug>
               <au>
                  <snm>Tovar</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Leon-Avila</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Sanchez</snm>
                  <fnm>LB</fnm>
               </au>
               <au>
                  <snm>Sutak</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Tachezy</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>van der Giezen</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Hernandez</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Muller</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Lucocq</snm>
                  <fnm>JM</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2003</pubdate>
            <volume>426</volume>
            <issue>6963</issue>
            <fpage>172</fpage>
            <lpage>176</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature01945</pubid>
                  <pubid idtype="pmpid" link="fulltext">14614504</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>Mitochondrial DNA as a genomic jigsaw puzzle</p>
            </title>
            <aug>
               <au>
                  <snm>Marande</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Burger</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2007</pubdate>
            <volume>318</volume>
            <issue>5849</issue>
            <fpage>415</fpage>
            <lpage>415</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1148033</pubid>
                  <pubid idtype="pmpid" link="fulltext">17947575</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>The highly reduced and fragmented mitochondrial genome of the early-branching dinoflagellate Oxyrrhis marina shares characteristics with both apicomplexan and dinoflagellate mitochondrial genomes</p>
            </title>
            <aug>
               <au>
                  <snm>Slamovits</snm>
                  <fnm>CH</fnm>
               </au>
               <au>
                  <snm>Saidarriaga</snm>
                  <fnm>JF</fnm>
               </au>
               <au>
                  <snm>Larocque</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Keeling</snm>
                  <fnm>PJ</fnm>
               </au>
            </aug>
            <source>J Mol Biol</source>
            <pubdate>2007</pubdate>
            <volume>372</volume>
            <issue>2</issue>
            <fpage>356</fpage>
            <lpage>368</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.jmb.2007.06.085</pubid>
                  <pubid idtype="pmpid" link="fulltext">17655860</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>Mitochondrial genome diversity: evolution of the molecular architecture and replication strategy</p>
            </title>
            <aug>
               <au>
                  <snm>Nosek</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Tomaska</snm>
                  <fnm>L</fnm>
               </au>
            </aug>
            <source>Curr Genet</source>
            <pubdate>2003</pubdate>
            <volume>44</volume>
            <issue>2</issue>
            <fpage>73</fpage>
            <lpage>84</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/s00294-003-0426-z</pubid>
                  <pubid idtype="pmpid" link="fulltext">12898180</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>Complete sequence of the mitochondrial genome of Tetrahymena pyriformis and comparison with Paramecium aurelia mitochondrial DNA</p>
            </title>
            <aug>
               <au>
                  <snm>Burger</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Zhu</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Littlejohn</snm>
                  <fnm>TG</fnm>
               </au>
               <au>
                  <snm>Greenwood</snm>
                  <fnm>SJ</fnm>
               </au>
               <au>
                  <snm>Schnare</snm>
                  <fnm>MN</fnm>
               </au>
               <au>
                  <snm>Lang</snm>
                  <fnm>BF</fnm>
               </au>
               <au>
                  <snm>Gray</snm>
                  <fnm>MW</fnm>
               </au>
            </aug>
            <source>J Mol Biol</source>
            <pubdate>2000</pubdate>
            <volume>297</volume>
            <issue>2</issue>
            <fpage>365</fpage>
            <lpage>380</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1006/jmbi.2000.3529</pubid>
                  <pubid idtype="pmpid" link="fulltext">10715207</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>Mitochondrial DNA of Chlamydomonas reinhardtii: the structure of the ends of the linear 15.8 Kb genome suggests mechanisms for DNA replication</p>
            </title>
            <aug>
               <au>
                  <snm>Vahrenholz</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Riemen</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Pratje</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Dujon</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Michaelis</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>Curr Genet</source>
            <pubdate>1993</pubdate>
            <volume>24</volume>
            <issue>3</issue>
            <fpage>241</fpage>
            <lpage>247</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/BF00351798</pubid>
                  <pubid idtype="pmpid">8221933</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>Mitochondrial genome of the moon jelly Aurelia aurita (Cnidaria, Scyphozoa): A linear DNA molecule encoding a putative DNA-dependent DNA polymerase</p>
            </title>
            <aug>
               <au>
                  <snm>Shao</snm>
                  <fnm>ZY</fnm>
               </au>
               <au>
                  <snm>Graf</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Chaga</snm>
                  <fnm>OY</fnm>
               </au>
               <au>
                  <snm>Lavrov</snm>
                  <fnm>DV</fnm>
               </au>
            </aug>
            <source>Gene</source>
            <pubdate>2006</pubdate>
            <volume>381</volume>
            <fpage>92</fpage>
            <lpage>101</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.gene.2006.06.021</pubid>
                  <pubid idtype="pmpid" link="fulltext">16945488</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p>Reaching for the ring: the study of mitochondrial genome structure</p>
            </title>
            <aug>
               <au>
                  <snm>Bendich</snm>
                  <fnm>AJ</fnm>
               </au>
            </aug>
            <source>Curr Genet</source>
            <pubdate>1993</pubdate>
            <volume>24</volume>
            <issue>4</issue>
            <fpage>279</fpage>
            <lpage>290</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/BF00336777</pubid>
                  <pubid idtype="pmpid">8252636</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B10">
            <title>
               <p>Structural analysis of mitochondrial DNA molecules from fungi and plants using moving pictures and pulsed-field gel electrophoresis</p>
            </title>
            <aug>
               <au>
                  <snm>Bendich</snm>
                  <fnm>AJ</fnm>
               </au>
            </aug>
            <source>J Mol Biol</source>
            <pubdate>1996</pubdate>
            <volume>255</volume>
            <issue>4</issue>
            <fpage>564</fpage>
            <lpage>588</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1006/jmbi.1996.0048</pubid>
                  <pubid idtype="pmpid" link="fulltext">8568898</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>Unique mitochondrial genome structure in diplonemids, the sister group of kinetoplastids</p>
            </title>
            <aug>
               <au>
                  <snm>Marande</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Lukes</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Burger</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>Eukaryot Cell</source>
            <pubdate>2005</pubdate>
            <volume>4</volume>
            <issue>12</issue>
            <fpage>2170</fpage>
            <lpage>2170</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1128/EC.4.12.2170.2005</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>An ancestral mitochondrial DNA resembling a eubacterial genome in miniature</p>
            </title>
            <aug>
               <au>
                  <snm>Lang</snm>
                  <fnm>BF</fnm>
               </au>
               <au>
                  <snm>Burger</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>O'Kelly</snm>
                  <fnm>CJ</fnm>
               </au>
               <au>
                  <snm>Cedergren</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Golding</snm>
                  <fnm>GB</fnm>
               </au>
               <au>
                  <snm>Lemieux</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Sankoff</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Turmel</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Gray</snm>
                  <fnm>MW</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>1997</pubdate>
            <volume>387</volume>
            <issue>6632</issue>
            <fpage>493</fpage>
            <lpage>497</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/387493a0</pubid>
                  <pubid idtype="pmpid" link="fulltext">9168110</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B13">
            <title>
               <p>The Plasmodium falciparum 6 kb element is polycistronically transcribed</p>
            </title>
            <aug>
               <au>
                  <snm>Ji</snm>
                  <fnm>YE</fnm>
               </au>
               <au>
                  <snm>Mericle</snm>
                  <fnm>BL</fnm>
               </au>
               <au>
                  <snm>Rehkopf</snm>
                  <fnm>DH</fnm>
               </au>
               <au>
                  <snm>Anderson</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Feagin</snm>
                  <fnm>JE</fnm>
               </au>
            </aug>
            <source>Mol Biochem Parasitol</source>
            <pubdate>1996</pubdate>
            <volume>81</volume>
            <issue>2</issue>
            <fpage>211</fpage>
            <lpage>223</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/0166-6851(96)02712-0</pubid>
                  <pubid idtype="pmpid" link="fulltext">8898336</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B14">
            <title>
               <p>Failure to detect DNA in hydrogenosomes of Trichomonas vaginalis by nick translation and immunomicroscopy</p>
            </title>
            <aug>
               <au>
                  <snm>Clemens</snm>
                  <fnm>DL</fnm>
               </au>
               <au>
                  <snm>Johnson</snm>
                  <fnm>PJ</fnm>
               </au>
            </aug>
            <source>Mol Biochem Parasitol</source>
            <pubdate>2000</pubdate>
            <volume>106</volume>
            <issue>2</issue>
            <fpage>307</fpage>
            <lpage>313</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0166-6851(99)00220-0</pubid>
                  <pubid idtype="pmpid" link="fulltext">10699261</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B15">
            <title>
               <p>Eukaryotic evolution, changes and challenges</p>
            </title>
            <aug>
               <au>
                  <snm>Embley</snm>
                  <fnm>TM</fnm>
               </au>
               <au>
                  <snm>Martin</snm>
                  <fnm>W</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2006</pubdate>
            <volume>440</volume>
            <issue>7084</issue>
            <fpage>623</fpage>
            <lpage>630</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature04546</pubid>
                  <pubid idtype="pmpid" link="fulltext">16572163</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B16">
            <title>
               <p>Evolution of RNA editing in kinetoplastid protozoa</p>
            </title>
            <aug>
               <au>
                  <snm>Maslov</snm>
                  <fnm>DA</fnm>
               </au>
               <au>
                  <snm>Avila</snm>
                  <fnm>HA</fnm>
               </au>
               <au>
                  <snm>Lake</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Simpson</snm>
                  <fnm>L</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>1994</pubdate>
            <volume>368</volume>
            <issue>6469</issue>
            <fpage>345</fpage>
            <lpage>348</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/368345a0</pubid>
                  <pubid idtype="pmpid" link="fulltext">8127370</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B17">
            <title>
               <p>The mitochondrial DNA of land plants: peculiarities in phylogenetic perspective</p>
            </title>
            <aug>
               <au>
                  <snm>Knoop</snm>
                  <fnm>V</fnm>
               </au>
            </aug>
            <source>Curr Genet</source>
            <pubdate>2004</pubdate>
            <volume>46</volume>
            <issue>3</issue>
            <fpage>123</fpage>
            <lpage>139</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/s00294-004-0522-8</pubid>
                  <pubid idtype="pmpid" link="fulltext">15300404</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B18">
            <title>
               <p>Mitochondrial cytochrome b mRNA editing in dinoflagellates: possible ecological and evolutionary associations?</p>
            </title>
            <aug>
               <au>
                  <snm>Zhang</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Lin</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>J Eukaryot Microbiol</source>
            <pubdate>2005</pubdate>
            <volume>52</volume>
            <issue>6</issue>
            <fpage>538</fpage>
            <lpage>545</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1111/j.1550-7408.2005.00060.x</pubid>
                  <pubid idtype="pmpid" link="fulltext">16313447</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B19">
            <title>
               <p>Widespread and extensive editing of mitochondrial mRNAs in dinoflagellates</p>
            </title>
            <aug>
               <au>
                  <snm>Lin</snm>
                  <fnm>SJ</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>HA</fnm>
               </au>
               <au>
                  <snm>Spencer</snm>
                  <fnm>DF</fnm>
               </au>
               <au>
                  <snm>Norman</snm>
                  <fnm>JE</fnm>
               </au>
               <au>
                  <snm>Gray</snm>
                  <fnm>MW</fnm>
               </au>
            </aug>
            <source>J Mol Biol</source>
            <pubdate>2002</pubdate>
            <volume>320</volume>
            <issue>4</issue>
            <fpage>727</fpage>
            <lpage>739</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0022-2836(02)00468-0</pubid>
                  <pubid idtype="pmpid" link="fulltext">12095251</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B20">
            <title>
               <p>Algae</p>
            </title>
            <aug>
               <au>
                  <snm>Graham</snm>
                  <fnm>LE</fnm>
               </au>
               <au>
                  <snm>Wilcox</snm>
                  <fnm>LW</fnm>
               </au>
            </aug>
            <publisher>Upper Saddle River, NJ, Prentice Hall</publisher>
            <pubdate>2000</pubdate>
         </bibl>
         <bibl id="B21">
            <title>
               <p>The new higher level classification of eukaryotes with emphasis on the taxonomy of protists</p>
            </title>
            <aug>
               <au>
                  <snm>Adl</snm>
                  <fnm>SM</fnm>
               </au>
               <au>
                  <snm>Simpson</snm>
                  <fnm>AGB</fnm>
               </au>
               <au>
                  <snm>Farmer</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Andersen</snm>
                  <fnm>RA</fnm>
               </au>
               <au>
                  <snm>Anderson</snm>
                  <fnm>OR</fnm>
               </au>
               <au>
                  <snm>Barta</snm>
                  <fnm>JR</fnm>
               </au>
               <au>
                  <snm>Bowser</snm>
                  <fnm>SS</fnm>
               </au>
               <au>
                  <snm>Brugerolle</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Fensome</snm>
                  <fnm>RA</fnm>
               </au>
               <au>
                  <snm>Fredericq</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>James</snm>
                  <fnm>TY</fnm>
               </au>
               <au>
                  <snm>Karpov</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Kugrens</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Krug</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Lane</snm>
                  <fnm>CE</fnm>
               </au>
               <au>
                  <snm>Lewis</snm>
                  <fnm>LA</fnm>
               </au>
               <au>
                  <snm>Lodge</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Lynn</snm>
                  <fnm>DH</fnm>
               </au>
               <au>
                  <snm>Mann</snm>
                  <fnm>DG</fnm>
               </au>
               <au>
                  <snm>McCourt</snm>
                  <fnm>RM</fnm>
               </au>
               <au>
                  <snm>Mendoza</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Moestrup</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Mozley-Standridge</snm>
                  <fnm>SE</fnm>
               </au>
               <au>
                  <snm>Nerad</snm>
                  <fnm>TA</fnm>
               </au>
               <au>
                  <snm>Shearer</snm>
                  <fnm>CA</fnm>
               </au>
               <au>
                  <snm>Smirnov</snm>
                  <fnm>AV</fnm>
               </au>
               <au>
                  <snm>Spiegel</snm>
                  <fnm>FW</fnm>
               </au>
               <au>
                  <snm>Taylor</snm>
                  <fnm>MFJR</fnm>
               </au>
            </aug>
            <source>J Eukaryot Microbiol</source>
            <pubdate>2005</pubdate>
            <volume>52</volume>
            <issue>5</issue>
            <fpage>399</fpage>
            <lpage>451</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1111/j.1550-7408.2005.00053.x</pubid>
                  <pubid idtype="pmpid" link="fulltext">16248873</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B22">
            <title>
               <p>Nucleomorph genomes: structure, function, origin and evolution</p>
            </title>
            <aug>
               <au>
                  <snm>Archibald</snm>
                  <fnm>JM</fnm>
               </au>
            </aug>
            <source>Bioessays</source>
            <pubdate>2007</pubdate>
            <volume>29</volume>
            <issue>4</issue>
            <fpage>392</fpage>
            <lpage>402</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1002/bies.20551</pubid>
                  <pubid idtype="pmpid" link="fulltext">17373660</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B23">
            <title>
               <p>The highly reduced genome of an enslaved algal nucleus</p>
            </title>
            <aug>
               <au>
                  <snm>Douglas</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Zauner</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Fraunholz</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Beaton</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Penny</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Deng</snm>
                  <fnm>LT</fnm>
               </au>
               <au>
                  <snm>Wu</snm>
                  <fnm>XN</fnm>
               </au>
               <au>
                  <snm>Reith</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Cavalier-Smith</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Maier</snm>
                  <fnm>UG</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2001</pubdate>
            <volume>410</volume>
            <issue>6832</issue>
            <fpage>1091</fpage>
            <lpage>1096</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/35074092</pubid>
                  <pubid idtype="pmpid" link="fulltext">11323671</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B24">
            <title>
               <p>The Rhodomonas salina mitochondrial genome: bacteria-like operons, compact gene arrangement and complex repeat region</p>
            </title>
            <aug>
               <au>
                  <snm>Hauth</snm>
                  <fnm>AM</fnm>
               </au>
               <au>
                  <snm>Maier</snm>
                  <fnm>UG</fnm>
               </au>
               <au>
                  <snm>Lang</snm>
                  <fnm>BF</fnm>
               </au>
               <au>
                  <snm>Burger</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2005</pubdate>
            <volume>33</volume>
            <issue>14</issue>
            <fpage>4433</fpage>
            <lpage>4442</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1183108</pubid>
                  <pubid idtype="pmpid" link="fulltext">16085754</pubid>
                  <pubid idtype="doi">10.1093/nar/gki757</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B25">
            <title>
               <p>Nucleomorph genome of Hemiselmis andersenii reveals complete intron loss and compaction as a driver of protein structure and function.</p>
            </title>
            <aug>
               <au>
                  <snm>Lane</snm>
                  <fnm>CE</fnm>
               </au>
               <au>
                  <snm>van den Heuvel</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Korera</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Curtis</snm>
                  <fnm>BA</fnm>
               </au>
               <au>
                  <snm>Parsons</snm>
                  <fnm>BJ</fnm>
               </au>
               <au>
                  <snm>Bowman</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Archibald</snm>
                  <fnm>JM</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci U S A</source>
            <pubdate>2007</pubdate>
            <volume>104</volume>
            <fpage>19908</fpage>
            <lpage>19913</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1073/pnas.0707419104</pubid>
                  <pubid idtype="pmpid" link="fulltext">18077423</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B26">
            <title>
               <p>The Staden package, 1998</p>
            </title>
            <aug>
               <au>
                  <snm>Staden</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Beal</snm>
                  <fnm>KF</fnm>
               </au>
               <au>
                  <snm>Bonfield</snm>
                  <fnm>JK</fnm>
               </au>
            </aug>
            <source>Methods Mol Biol</source>
            <pubdate>2000</pubdate>
            <volume>132</volume>
            <fpage>115</fpage>
            <lpage>130</lpage>
            <xrefbib>
               <pubid idtype="pmpid">10547834</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B27">
            <title>
               <p>MacClade 4: analysis of phylogeny and character evolution</p>
            </title>
            <aug>
               <au>
                  <snm>Maddison</snm>
                  <fnm>DR</fnm>
               </au>
               <au>
                  <snm>Maddison</snm>
                  <fnm>WP</fnm>
               </au>
            </aug>
            <publisher>Sunderland, MA, Sinauer Associates Inc.</publisher>
            <pubdate>2001</pubdate>
         </bibl>
         <bibl id="B28">
            <title>
               <p>Mfold web server for nucleic acid folding and hybridization prediction</p>
            </title>
            <aug>
               <au>
                  <snm>Zuker</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2003</pubdate>
            <volume>31</volume>
            <issue>13</issue>
            <fpage>3406</fpage>
            <lpage>3415</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">169194</pubid>
                  <pubid idtype="pmpid" link="fulltext">12824337</pubid>
                  <pubid idtype="doi">10.1093/nar/gkg595</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B29">
            <title>
               <p>Insight into the diversity and evolution of the cryptomonad nucleomorph genome</p>
            </title>
            <aug>
               <au>
                  <snm>Lane</snm>
                  <fnm>CE</fnm>
               </au>
               <au>
                  <snm>Khan</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>MacKinnon</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Fong</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Theophilou</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Archibald</snm>
                  <fnm>JM</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>2006</pubdate>
            <volume>23</volume>
            <issue>9</issue>
            <fpage>1817</fpage>
            <lpage>1817</lpage>
         </bibl>
         <bibl id="B30">
            <title>
               <p>Novel nucleomorph genome architecture in the cryptomonad genus Hemiselmis</p>
            </title>
            <aug>
               <au>
                  <snm>Lane</snm>
                  <fnm>CE</fnm>
               </au>
               <au>
                  <snm>Archibald</snm>
                  <fnm>JM</fnm>
               </au>
            </aug>
            <source>J Eukaryot Microbiol</source>
            <pubdate>2006</pubdate>
            <volume>53</volume>
            <issue>6</issue>
            <fpage>515</fpage>
            <lpage>521</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1111/j.1550-7408.2006.00135.x</pubid>
                  <pubid idtype="pmpid" link="fulltext">17123416</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B31">
            <title>
               <p>Artemis: sequence visualization and annotation</p>
            </title>
            <aug>
               <au>
                  <snm>Rutherford</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Parkhill</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Crook</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Horsnell</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Rice</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Rajandream</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Barrell</snm>
                  <fnm>B</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2000</pubdate>
            <volume>16</volume>
            <issue>10</issue>
            <fpage>944</fpage>
            <lpage>945</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/16.10.944</pubid>
                  <pubid idtype="pmpid" link="fulltext">11120685</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B32">
            <title>
               <p>tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence</p>
            </title>
            <aug>
               <au>
                  <snm>Lowe</snm>
                  <fnm>TM</fnm>
               </au>
               <au>
                  <snm>Eddy</snm>
                  <fnm>SR</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>1997</pubdate>
            <volume>25</volume>
            <issue>5</issue>
            <fpage>955</fpage>
            <lpage>964</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">146525</pubid>
                  <pubid idtype="pmpid" link="fulltext">9023104</pubid>
                  <pubid idtype="doi">10.1093/nar/25.5.955</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B33">
            <title>
               <p>GRIMM: genome rearrangements web server</p>
            </title>
            <aug>
               <au>
                  <snm>Tesler</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2002</pubdate>
            <volume>18</volume>
            <issue>3</issue>
            <fpage>492</fpage>
            <lpage>493</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/18.3.492</pubid>
                  <pubid idtype="pmpid" link="fulltext">11934753</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B34">
            <title>
               <p>RAxML-VI-HPC: maximum likelihood-based phylogenetic analyses with thousands of taxa and mixed models</p>
            </title>
            <aug>
               <au>
                  <snm>Stamatakis</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2006</pubdate>
            <volume>22</volume>
            <issue>21</issue>
            <fpage>2688</fpage>
            <lpage>2690</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/btl446</pubid>
                  <pubid idtype="pmpid" link="fulltext">16928733</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B35">
            <title>
               <p>The mitochondria DNA of the ameboid protozoan, Acanthamoeba castellanii: complete sequence, gene content and genome organization</p>
            </title>
            <aug>
               <au>
                  <snm>Burger</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Plante</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Lonergan</snm>
                  <fnm>KM</fnm>
               </au>
               <au>
                  <snm>Gray</snm>
                  <fnm>MW</fnm>
               </au>
            </aug>
            <source>J Mol Biol</source>
            <pubdate>1995</pubdate>
            <volume>245</volume>
            <issue>5</issue>
            <fpage>522</fpage>
            <lpage>537</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1006/jmbi.1994.0043</pubid>
                  <pubid idtype="pmpid" link="fulltext">7844823</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B36">
            <title>
               <p>The mitochondrial genome of the thermal dimorphic fungus Penicillium marneffei is more closely related to those of molds than yeasts</p>
            </title>
            <aug>
               <au>
                  <snm>Woo</snm>
                  <fnm>PCY</fnm>
               </au>
               <au>
                  <snm>Zhen</snm>
                  <fnm>HJ</fnm>
               </au>
               <au>
                  <snm>Cai</snm>
                  <fnm>JJ</fnm>
               </au>
               <au>
                  <snm>Yu</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Lau</snm>
                  <fnm>SKP</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Teng</snm>
                  <fnm>JLL</fnm>
               </au>
               <au>
                  <snm>Wong</snm>
                  <fnm>SSY</fnm>
               </au>
               <au>
                  <snm>Tse</snm>
                  <fnm>RH</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Yang</snm>
                  <fnm>HM</fnm>
               </au>
               <au>
                  <snm>Liu</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Yuen</snm>
                  <fnm>KY</fnm>
               </au>
            </aug>
            <source>FEBS Lett</source>
            <pubdate>2003</pubdate>
            <volume>555</volume>
            <issue>3</issue>
            <fpage>469</fpage>
            <lpage>477</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0014-5793(03)01307-3</pubid>
                  <pubid idtype="pmpid" link="fulltext">14675758</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B37">
            <title>
               <p>Comparative structure and genomic organization of the discontinuous mitochondrial ribosomal RNA genes of Chlamydomonas eugametos and Chlamydomonas reinhardtii</p>
            </title>
            <aug>
               <au>
                  <snm>Denovanwright</snm>
                  <fnm>EM</fnm>
               </au>
               <au>
                  <snm>Lee</snm>
                  <fnm>RW</fnm>
               </au>
            </aug>
            <source>J Mol Biol</source>
            <pubdate>1994</pubdate>
            <volume>241</volume>
            <issue>2</issue>
            <fpage>298</fpage>
            <lpage>311</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1006/jmbi.1994.1505</pubid>
                  <pubid idtype="pmpid" link="fulltext">7520083</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B38">
            <title>
               <p>The complete mitochondrial DNA sequences of Nephroselmis olivacea and Pedinomonas minor. Two radically different evolutionary patterns within green algae</p>
            </title>
            <aug>
               <au>
                  <snm>Turmel</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Lemieux</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Burger</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Lang</snm>
                  <fnm>BF</fnm>
               </au>
               <au>
                  <snm>Otis</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Plante</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Gray</snm>
                  <fnm>MW</fnm>
               </au>
            </aug>
            <source>Plant Cell</source>
            <pubdate>1999</pubdate>
            <volume>11</volume>
            <issue>9</issue>
            <fpage>1717</fpage>
            <lpage>1730</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">144307</pubid>
                  <pubid idtype="pmpid" link="fulltext">10488238</pubid>
                  <pubid idtype="doi">10.1105/tpc.11.9.1717</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B39">
            <title>
               <p>Base composition skews, replication orientation, and gene orientation in 12 prokaryote genomes</p>
            </title>
            <aug>
               <au>
                  <snm>Mclean</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Wolfe</snm>
                  <fnm>KH</fnm>
               </au>
               <au>
                  <snm>Devine</snm>
                  <fnm>KM</fnm>
               </au>
            </aug>
            <source>J Mol Evol</source>
            <pubdate>1998</pubdate>
            <volume>47</volume>
            <issue>6</issue>
            <fpage>691</fpage>
            <lpage>696</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/PL00006428</pubid>
                  <pubid idtype="pmpid" link="fulltext">9847411</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B40">
            <title>
               <p>tRNomics: analysis of tRNA genes from 50 genomes of Eukarya, Archaea, and Bacteria reveals anticodon-sparing strategies and domain-specific features</p>
            </title>
            <aug>
               <au>
                  <snm>Marck</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Grosjean</snm>
                  <fnm>H</fnm>
               </au>
            </aug>
            <source>RNA</source>
            <pubdate>2002</pubdate>
            <volume>8</volume>
            <issue>10</issue>
            <fpage>1189</fpage>
            <lpage>1232</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1370332</pubid>
                  <pubid idtype="pmpid" link="fulltext">12403461</pubid>
                  <pubid idtype="doi">10.1017/S1355838202022021</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B41">
            <title>
               <p>Mitochondrial tRNA import in Toxoplasma gondii</p>
            </title>
            <aug>
               <au>
                  <snm>Esseiva</snm>
                  <fnm>AC</fnm>
               </au>
               <au>
                  <snm>Naguleswaran</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Hemphill</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Schneider</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>J Biol Chem</source>
            <pubdate>2004</pubdate>
            <volume>279</volume>
            <issue>41</issue>
            <fpage>42363</fpage>
            <lpage>42368</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1074/jbc.M404519200</pubid>
                  <pubid idtype="pmpid" link="fulltext">15280394</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B42">
            <title>
               <p>Identification and structural characterization of nucleus-encoded transfer RNAs imported into wheat mitochondria</p>
            </title>
            <aug>
               <au>
                  <snm>Glover</snm>
                  <fnm>KE</fnm>
               </au>
               <au>
                  <snm>Spencer</snm>
                  <fnm>DF</fnm>
               </au>
               <au>
                  <snm>Gray</snm>
                  <fnm>MW</fnm>
               </au>
            </aug>
            <source>J Biol Chem</source>
            <pubdate>2001</pubdate>
            <volume>276</volume>
            <issue>1</issue>
            <fpage>639</fpage>
            <lpage>648</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1074/jbc.M007708200</pubid>
                  <pubid idtype="pmpid" link="fulltext">11027690</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B43">
            <title>
               <p>Mitochondrial tRNA import: are there distinct mechanisms?</p>
            </title>
            <aug>
               <au>
                  <snm>Schneider</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Marechal-Drouard</snm>
                  <fnm>L</fnm>
               </au>
            </aug>
            <source>Trends Cell Biol</source>
            <pubdate>2000</pubdate>
            <volume>10</volume>
            <issue>12</issue>
            <fpage>509</fpage>
            <lpage>513</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0962-8924(00)01854-7</pubid>
                  <pubid idtype="pmpid" link="fulltext">11121736</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B44">
            <title>
               <p>tRNA transfers to the limelight</p>
            </title>
            <aug>
               <au>
                  <snm>Hopper</snm>
                  <fnm>AK</fnm>
               </au>
               <au>
                  <snm>Phizicky</snm>
                  <fnm>EM</fnm>
               </au>
            </aug>
            <source>Genes Dev</source>
            <pubdate>2003</pubdate>
            <volume>17</volume>
            <issue>2</issue>
            <fpage>162</fpage>
            <lpage>180</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1101/gad.1049103</pubid>
                  <pubid idtype="pmpid" link="fulltext">12533506</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B45">
            <title>
               <p>Genome structure and gene content in protist mitochondrial DNAs</p>
            </title>
            <aug>
               <au>
                  <snm>Gray</snm>
                  <fnm>MW</fnm>
               </au>
               <au>
                  <snm>Lang</snm>
                  <fnm>BF</fnm>
               </au>
               <au>
                  <snm>Cedergren</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Golding</snm>
                  <fnm>GB</fnm>
               </au>
               <au>
                  <snm>Lemieux</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Sankoff</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Turmel</snm>
                  <fnm>M</fnm>
               </au>
               <au>
  