<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art><ui>1471-2164-12-73</ui><ji>1471-2164</ji><fm>
<dochead>Research article</dochead>
<bibl>
<title>
<p>Distinctive mitochondrial genome of Calanoid copepod <it>Calanus sinicus </it>with multiple large non-coding regions and reshuffled gene order: Useful molecular markers for phylogenetic and population studies</p>
</title>
<aug>
<au id="A1" ce="yes"><snm>Minxiao</snm><fnm>Wang</fnm><insr iid="I1"/><insr iid="I2"/><email>wangminxiao@qdio.ac.cn</email></au>
<au ca="yes" id="A2"><snm>Song</snm><fnm>Sun</fnm><insr iid="I1"/><email>sunsong@qdio.ac.cn</email></au>
<au id="A3" ce="yes"><snm>Chaolun</snm><fnm>Li</fnm><insr iid="I1"/><email>lcl@qdio.ac.cn</email></au>
<au id="A4"><snm>Xin</snm><fnm>Shen</fnm><insr iid="I3"/><email>shenthin@163.com</email></au>
</aug>
<insg>
<ins id="I1"><p>KLMEES and JBMERS, Institute of Oceanology, Chinese Academy of Sciences, 7 Nanhai Road, Qingdao 266071, China</p></ins>
<ins id="I2"><p>Graduate University, Chinese Academy of Sciences, 19 Yuquan Road, Beijing 100039, China</p></ins>
<ins id="I3"><p>Huaihai Institute of Technology, 59 Cangwu Road, Lianyungang 222005, China</p></ins>
</insg>
<source>BMC Genomics</source>
<issn>1471-2164</issn>
<pubdate>2011</pubdate>
<volume>12</volume>
<issue>1</issue>
<fpage>73</fpage>
<url>http://www.biomedcentral.com/1471-2164/12/73</url>
<xrefbib><pubidlist><pubid idtype="doi">10.1186/1471-2164-12-73</pubid><pubid idtype="pmpid">21269523</pubid></pubidlist></xrefbib>
</bibl>
<history><rec><date><day>2</day><month>7</month><year>2010</year></date></rec><acc><date><day>27</day><month>1</month><year>2011</year></date></acc><pub><date><day>27</day><month>1</month><year>2011</year></date></pub></history>
<cpyrt><year>2011</year><collab>Minxiao et al; licensee BioMed Central Ltd.</collab><note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note></cpyrt>
<abs>
<sec>
<st>
<p>Abstract</p>
</st>
<sec>
<st>
<p>Background</p>
</st>
<p>Copepods are highly diverse and abundant, resulting in extensive ecological radiation in marine ecosystems. <it>Calanus sinicus </it>dominates continental shelf waters in the northwest Pacific Ocean and plays an important role in the local ecosystem by linking primary production to higher trophic levels. A lack of effective molecular markers has hindered phylogenetic and population genetic studies concerning copepods. As they are genome-level informative, mitochondrial DNA sequences can be used as markers for population genetic studies and phylogenetic studies.</p>
</sec>
<sec>
<st>
<p>Results</p>
</st>
<p>The mitochondrial genome of <it>C. sinicus </it>is distinct from other arthropods owing to the concurrence of multiple non-coding regions and a reshuffled gene arrangement. Further particularities in the mitogenome of <it>C. sinicus </it>include low A + T-content, symmetrical nucleotide composition between strands, abbreviated stop codons for several PCGs and extended lengths of the genes <it>atp6 </it>and <it>atp8 </it>relative to other copepods. The monophyletic Copepoda should be placed within the Vericrustacea. The close affinity between Cyclopoida and Poecilostomatoida suggests reassigning the latter as subordinate to the former. Monophyly of Maxillopoda is rejected. Within the alignment of 11 <it>C. sinicus </it>mitogenomes, there are 397 variable sites harbouring three 'hotspot' variable sites and three microsatellite loci.</p>
</sec>
<sec>
<st>
<p>Conclusion</p>
</st>
<p>The occurrence of the <it>circular subgenomic fragment </it>during laboratory assays suggests that special caution should be taken when sequencing mitogenomes using long PCR. Such a phenomenon may provide additional evidence of mitochondrial DNA recombination, which appears to have been a prerequisite for shaping the present mitochondrial profile of <it>C. sinicus </it>during its evolution. The lack of synapomorphic gene arrangements among copepods has cast doubt on the utility of gene order as a useful molecular marker for deep phylogenetic analysis. However, mitochondrial genomic sequences have been valuable markers for resolving phylogenetic issues concerning copepods. The variable site maps of <it>C. sinicus </it>mitogenomes provide a solid foundation for population genetic studies.</p>
</sec>
</sec>
</abs>
</fm><bdy>
<sec>
<st>
<p>Background</p>
</st>
<p>Copepods play an important role in the aquatic ecosystem and are highly diverse. They comprise a multitude of taxa including 200 families, 1,650 genera and 11,500 species <abbrgrp>
<abbr bid="B1">1</abbr>
</abbrgrp>, although this estimation may represent only 15% of the actual number <abbrgrp>
<abbr bid="B2">2</abbr>
</abbrgrp>. Copepods have successfully colonized almost all aquatic regimes and have developed diverse life styles <abbrgrp>
<abbr bid="B3">3</abbr>
</abbrgrp>. Therefore, phylogenetic studies are required to develop a complete biodiversity inventory of the group, which will enable the question of how copepods have acquired such diversity over time to be investigated.</p>
<p>Several incompatible classification schemes have been proposed for copepods on the basis of morphological characteristics <abbrgrp>
<abbr bid="B4">4</abbr>
</abbrgrp>. Since the incorporation of copepods as a monophyletic group in 1859, phylogenetic studies have focused on the natural relationships between the incorporated orders, Calanoida, Cyclopoida, Gellyelloida, Harpacticoida, Misophrioida, Monstrilloida, Mormonilloida, Platycopioida, Poecilostomatoida and Siphonostomatoida <abbrgrp>
<abbr bid="B3">3</abbr>
</abbrgrp>. Dussart (1984) classified Calanoida and Poecilostomatoida together in the lineage Cyclopinidae-Oithonidae-(Poecilostomatoida-Calanoida) <abbrgrp>
<abbr bid="B5">5</abbr>
</abbrgrp> while other researchers have classified the Calanoida outside Podoplea, at the relative basal position <abbrgrp>
<abbr bid="B3">3</abbr>
<abbr bid="B6">6</abbr>
</abbrgrp>. Kabata, Marcotte and Boxshall hypothesised that Poecilostomatoida is the sister group to Cyclopoida. However, other studies have placed Poecilostomatoida and Siphonostomatoida within close phylogenetic affinity <abbrgrp>
<abbr bid="B3">3</abbr>
<abbr bid="B6">6</abbr>
</abbrgrp>. Recently, Boxshall reassigned Poecilostomatoida as a suborder of Cyclopoida <abbrgrp>
<abbr bid="B7">7</abbr>
</abbrgrp>. The relationships among copepods and other subgroups of Pancrustacea have yet to be elucidated with 11 alternative sister group hypotheses being proposed for the taxon <abbrgrp>
<abbr bid="B8">8</abbr>
</abbrgrp>. The recent ambiguous status of copepod phylogenetic research is due at least in part to the limited diagnostic morphological characteristics, difficulty in accessing morphological homology and a poor fossil record.</p>
<p>In metazoans, the mitochondrial genome is usually a circular, double-stranded DNA molecule (mtDNA), which spans a general length of 16 kb but can vary from 14 to 48 kb. The gene content is conserved with 37 genes: 13 protein-encoding genes, two ribosomal RNA genes, 22 transfer RNA (tRNA) genes and one or more non-coding region(s) containing signals for transcription and replication of the mtDNA <abbrgrp>
<abbr bid="B9">9</abbr>
</abbrgrp>. Several advantages including accelerated substitution rates, (almost) unambiguous orthology and being genome-level informative <abbrgrp>
<abbr bid="B10">10</abbr>
<abbr bid="B11">11</abbr>
</abbrgrp> have allowed the mitochondrial genome to be widely used for population studies <abbrgrp>
<abbr bid="B12">12</abbr>
<abbr bid="B13">13</abbr>
</abbrgrp>, phylogeography <abbrgrp>
<abbr bid="B12">12</abbr>
<abbr bid="B14">14</abbr>
</abbrgrp> and phylogenetic relationships at various taxonomic levels across animal taxa, particularly in arthropods <abbrgrp>
<abbr bid="B15">15</abbr>
<abbr bid="B16">16</abbr>
<abbr bid="B17">17</abbr>
</abbrgrp>. Furthermore, extensive intraspecific polymorphism in the non-coding regions facilitates studies at population level <abbrgrp>
<abbr bid="B17">17</abbr>
</abbrgrp>. However, there is little information concerning the structure and genetic polymorphism of the non-coding regions in crustaceans.</p>
<p>Despite the vast diversity of copepods, few mitochondrial genomes have been charted. Taxon sampling has been biased to certain orders including Harpacticoida: <it>Tigriopus japonicus </it>
<abbrgrp>
<abbr bid="B18">18</abbr>
<abbr bid="B19">19</abbr>
</abbrgrp>, <it>Tigriopus californicus </it>
<abbrgrp>
<abbr bid="B14">14</abbr>
</abbrgrp>; Siphonostomatoida: <it>Lepeophtheirus salmonis </it>
<abbrgrp>
<abbr bid="B20">20</abbr>
</abbrgrp> and Cyclopoida: <it>Paracyclopina nana </it>
<abbrgrp>
<abbr bid="B21">21</abbr>
</abbrgrp>. More mitochondrial genomes with increased taxon coverage are required to resolve several issues concerning copepod phylogeny including its phylogenetic position within Pancrustacea and the relationship of its component orders. <it>Calanus sinicus </it>(Copepoda: Calanoida) dominates continental shelf waters in the northwest Pacific Ocean, linking primary production and the larvae and juveniles of fishes <abbrgrp>
<abbr bid="B22">22</abbr>
</abbrgrp>. Given its ecological importance, <it>C. sinicus </it>is one of the target species in the China-GLOBEC program. Despite this status, there is little information concerning the population genetics of this species owing to the lack of suitable genetic markers. This study presents a near complete mitochondrial genome of <it>C. sinicus</it>, which represents the first member of the Calanoida. The gene order of <it>C. sinicus </it>was compared with other copepods to identify the evolution of the mitochondrial genomes in this group. Combining the new mitogenome and previously published mitogenomes from arthropods, a preliminary phylogenetic analysis was carried out to investigate the relationships between several orders in Copepoda and their positions within Pancrustacea. In addition, intraspecific polymorphisms of major loci in 11 <it>C. sinicus </it>mitogenomes from four populations were compared to screen potential markers for population studies.</p>
</sec>
<sec>
<st>
<p>Results and Discussion</p>
</st>
<sec>
<st>
<p>Genome Organization</p>
</st>
<p>A long-PCR-based genome sequencing protocol was adopted for animal mtDNAs. However, this technique failed to amplify a fragment containing partial non-coding regions and two tRNA genes. Several unknown factors including gene rearrangement, notable base composition bias, an extended length of GC-rich tract, highly repeated regions and stable secondary structures could terminate the movement of the polymerase and therefore complicate the recovery of mitogenomes from copepods using this technique <abbrgrp>
<abbr bid="B19">19</abbr>
<abbr bid="B21">21</abbr>
<abbr bid="B23">23</abbr>
</abbrgrp>.</p>
<p>The 20,460 bp assembled contig (Figure <figr fid="F1">1</figr>, Table <tblr tid="T1">1</tblr>) comprised all but two tRNA genes (<it>trnR </it>and <it>trnC</it>), and included 13 protein coding genes (<it>cox1</it>-<it>3</it>, <it>nad1</it>-<it>6</it>, <it>nad4L</it>, <it>atp6</it>, <it>atp8 </it>and <it>cytb</it>), two rRNA genes (<it>rrnS </it>and <it>rrnL</it>) and 20 tRNA genes. In addition, one of the long non-coding regions (LNR) between <it>trnH </it>and <it>trnA </it>was proposed as a control region (CR) on the basis of the secondary structure motifs identified. The majority of metazoan mitogenomes contain two abutting gene blocks, <it>nad4</it>/<it>nad4L </it>and <it>rrnS</it>/<it>rrnL</it>. However, these are separated in copepods. The 35 genes were located in three clusters interleaved by long non-coding regions (LNR1, LNR3 and LNR5). Unlike <it>Tigriopus sp</it>. <abbrgrp>
<abbr bid="B14">14</abbr>
</abbrgrp>, mitochondrial genes were encoded on both strands in <it>C. sinicus</it>, and the minority (<it>rrnL</it>, <it>trnV</it>, <it>trnD</it>, <it>trnT</it>, <it>nad4L</it>, <it>nad5</it>, <it>trnH</it>, <it>trnA</it>, <it>trnY</it>, <it>nad3</it>, <it>nad4</it>, <it>trnK</it>, <it>nad2</it>, <it>atp8 </it>and <it>atp6</it>) were identified on the H-strand (as defined by molecular weight). Of the 20 tRNA genes, 17 were arranged in three main clusters, V-D-T-S<sub>2</sub>, F-I and A-F-Y-E-Q-L<sub>1</sub>-P-M-K-W-S<sub>1</sub>-N, reading clockwise. Compactness is a characteristic feature of mitochondrial genomes <abbrgrp>
<abbr bid="B10">10</abbr>
</abbrgrp> and there were small gene overlaps at three gene borders. The largest overlap was identified between <it>trnY </it>and <it>trnE</it>, with a length of five nucleotides.</p>
<fig id="F1"><title><p>Figure 1</p></title><caption><p>Mitochondrial genome organization of Calanoida copepod <it>Calanus sinicus</it></p></caption><text>
   <p><b>Mitochondrial genome organization of Calanoida copepod <it>Calanus sinicus</it></b>. direction of gene transcription is indicated by the arrows. Protein-coding genes are shown as blue arrows, rRNA genes as purple arrows, tRNA genes as red arrows and large non-coding regions (>100 bp) as cyan rectangles. tRNA genes are labelled by single-letter IUPAC-IUB abbreviations (L1: CUN; L2:UUR; S1:AGN; S2:UCN) while other genes are represented as outlined in the abbreviations section. Ticks in the inner cycle indicate the sequence length.</p>
</text><graphic file="1471-2164-12-73-1" hint_layout="double"/></fig>
<tbl id="T1"><title><p>Table 1</p></title><caption><p>Mitochondrial genome profile and nucleotide composition of <it>C. sinicu</it><it>s</it>.</p></caption><tblbdy cols="10">
      <r>
         <c ca="center">
            <p>
               <b>Feature</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>strand</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>Position</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>Length</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>Start</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>Stop</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>AT %</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>GC-skew</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>AT-skew</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>intergenic nt</b>
               <sup>
                  <b>2</b>
               </sup>
            </p>
         </c>
      </r>
      <r>
         <c cspan="10">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>
               <b>
                  <it>rrnL</it>
               </b>
               <sup>
                  <it>3</it>
               </sup>
            </p>
         </c>
         <c ca="center">
            <p>H</p>
         </c>
         <c ca="center">
            <p>2244 - 3382</p>
         </c>
         <c ca="center">
            <p>1139</p>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>71.7</p>
         </c>
         <c ca="center">
            <p>0.0567</p>
         </c>
         <c ca="center">
            <p>0.0126</p>
         </c>
         <c ca="center">
            <p>>2959(LNR1)</p>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>
               <b>
                  <it>trnV</it>
               </b>
            </p>
         </c>
         <c ca="center">
            <p>H</p>
         </c>
         <c ca="center">
            <p>3383 - 3447</p>
         </c>
         <c ca="center">
            <p>65</p>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>83.1</p>
         </c>
         <c ca="center">
            <p>-0.0888</p>
         </c>
         <c ca="center">
            <p>0.0734</p>
         </c>
         <c ca="center">
            <p>3</p>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>
               <b>
                  <it>trnD</it>
               </b>
            </p>
         </c>
         <c ca="center">
            <p>H</p>
         </c>
         <c ca="center">
            <p>3457 - 3518</p>
         </c>
         <c ca="center">
            <p>62</p>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>80.6</p>
         </c>
         <c ca="center">
            <p>0.165</p>
         </c>
         <c ca="center">
            <p>0.000</p>
         </c>
         <c ca="center">
            <p>9</p>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>
               <b>
                  <it>trnT</it>
               </b>
            </p>
         </c>
         <c ca="center">
            <p>H</p>
         </c>
         <c ca="center">
            <p>3531 - 3593</p>
         </c>
         <c ca="center">
            <p>63</p>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>74.6</p>
         </c>
         <c ca="center">
            <p>-0.126</p>
         </c>
         <c ca="center">
            <p>-0.0643</p>
         </c>
         <c ca="center">
            <p>12</p>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>
               <b>
                  <it>trnS2</it>
               </b>
            </p>
         </c>
         <c ca="center">
            <p>L</p>
         </c>
         <c ca="center">
            <p>3594 - 3650</p>
         </c>
         <c ca="center">
            <p>57</p>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>73.7</p>
         </c>
         <c ca="center">
            <p>0.202</p>
         </c>
         <c ca="center">
            <p>0.0475</p>
         </c>
         <c ca="center">
            <p>0</p>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>
               <b>
                  <it>cox1</it>
               </b>
            </p>
         </c>
         <c ca="center">
            <p>L</p>
         </c>
         <c ca="center">
            <p>3660 - 5207</p>
         </c>
         <c ca="center">
            <p>1548</p>
         </c>
         <c ca="center">
            <p>ATA</p>
         </c>
         <c ca="center">
            <p>TAA</p>
         </c>
         <c ca="center">
            <p>59.0</p>
         </c>
         <c ca="center">
            <p>0.0439</p>
         </c>
         <c ca="center">
            <p>-0.190</p>
         </c>
         <c ca="center">
            <p>9</p>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>
               <b>
                  <it>nad4L</it>
               </b>
            </p>
         </c>
         <c ca="center">
            <p>H</p>
         </c>
         <c ca="center">
            <p>5336 - 5671</p>
         </c>
         <c ca="center">
            <p>336</p>
         </c>
         <c ca="center">
            <p>ATA</p>
         </c>
         <c ca="center">
            <p>TAG</p>
         </c>
         <c ca="center">
            <p>61.9</p>
         </c>
         <c ca="center">
            <p>0.0919</p>
         </c>
         <c ca="center">
            <p>-0.202</p>
         </c>
         <c ca="center">
            <p>128(LNR2)</p>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>
               <b>
                  <it>cytb</it>
               </b>
            </p>
         </c>
         <c ca="center">
            <p>L</p>
         </c>
         <c ca="center">
            <p>5751 - 6887</p>
         </c>
         <c ca="center">
            <p>1137</p>
         </c>
         <c ca="center">
            <p>ATG</p>
         </c>
         <c ca="center">
            <p>TAA</p>
         </c>
         <c ca="center">
            <p>60.2</p>
         </c>
         <c ca="center">
            <p>-0.0050</p>
         </c>
         <c ca="center">
            <p>-0.249</p>
         </c>
         <c ca="center">
            <p>79</p>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>
               <b>
                  <it>nad6</it>
               </b>
            </p>
         </c>
         <c ca="center">
            <p>L</p>
         </c>
         <c ca="center">
            <p>6898 - 7377</p>
         </c>
         <c ca="center">
            <p>480</p>
         </c>
         <c ca="center">
            <p>ATT</p>
         </c>
         <c ca="center">
            <p>TAG</p>
         </c>
         <c ca="center">
            <p>62.3</p>
         </c>
         <c ca="center">
            <p>0.0822</p>
         </c>
         <c ca="center">
            <p>-0.149</p>
         </c>
         <c ca="center">
            <p>10</p>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>
               <b>
                  <it>rrnS</it>
               </b>
               <sup>
                  <it>3</it>
               </sup>
            </p>
         </c>
         <c ca="center">
            <p>L</p>
         </c>
         <c ca="center">
            <p>7429 - 8082</p>
         </c>
         <c ca="center">
            <p>654</p>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>73.1</p>
         </c>
         <c ca="center">
            <p>0.264</p>
         </c>
         <c ca="center">
            <p>0.0369</p>
         </c>
         <c ca="center">
            <p>51</p>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>
               <b>
                  <it>trnG</it>
               </b>
            </p>
         </c>
         <c ca="center">
            <p>L</p>
         </c>
         <c ca="center">
            <p>8083 - 8146</p>
         </c>
         <c ca="center">
            <p>64</p>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>87.6</p>
         </c>
         <c ca="center">
            <p>0.000</p>
         </c>
         <c ca="center">
            <p>0.000</p>
         </c>
         <c ca="center">
            <p>3</p>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>
               <b>
                  <it>nad1</it>
               </b>
            </p>
         </c>
         <c ca="center">
            <p>L</p>
         </c>
         <c ca="center">
            <p>8147 - 9063</p>
         </c>
         <c ca="center">
            <p>917</p>
         </c>
         <c ca="center">
            <p>ATA</p>
         </c>
         <c ca="center">
            <p>TA<sup>1</sup></p>
         </c>
         <c ca="center">
            <p>62.5</p>
         </c>
         <c ca="center">
            <p>0.000</p>
         </c>
         <c ca="center">
            <p>-0.174</p>
         </c>
         <c ca="center">
            <p>0</p>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>
               <b>
                  <it>trnF</it>
               </b>
            </p>
         </c>
         <c ca="center">
            <p>L</p>
         </c>
         <c ca="center">
            <p>9064 - 9126</p>
         </c>
         <c ca="center">
            <p>63</p>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>66.6</p>
         </c>
         <c ca="center">
            <p>0.237</p>
         </c>
         <c ca="center">
            <p>0.000</p>
         </c>
         <c ca="center">
            <p>0</p>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>
               <b>
                  <it>trnI</it>
               </b>
            </p>
         </c>
         <c ca="center">
            <p>L</p>
         </c>
         <c ca="center">
            <p>9131 - 9193</p>
         </c>
         <c ca="center">
            <p>63</p>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>58.7</p>
         </c>
         <c ca="center">
            <p>0.153</p>
         </c>
         <c ca="center">
            <p>0.0801</p>
         </c>
         <c ca="center">
            <p>4</p>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>
               <b>
                  <it>nad5</it>
               </b>
            </p>
         </c>
         <c ca="center">
            <p>H</p>
         </c>
         <c ca="center">
            <p>9231 - 10954</p>
         </c>
         <c ca="center">
            <p>1724</p>
         </c>
         <c ca="center">
            <p>ATT</p>
         </c>
         <c ca="center">
            <p>TA<sup>1</sup></p>
         </c>
         <c ca="center">
            <p>59.5</p>
         </c>
         <c ca="center">
            <p>0.0272</p>
         </c>
         <c ca="center">
            <p>-0.109</p>
         </c>
         <c ca="center">
            <p>37</p>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>
               <b>
                  <it>trnH</it>
               </b>
            </p>
         </c>
         <c ca="center">
            <p>H</p>
         </c>
         <c ca="center">
            <p>10955 - 11017</p>
         </c>
         <c ca="center">
            <p>63</p>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>69.8</p>
         </c>
         <c ca="center">
            <p>0.369</p>
         </c>
         <c ca="center">
            <p>0.0917</p>
         </c>
         <c ca="center">
            <p>0</p>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>
               <b>
                  <it>trnA</it>
               </b>
            </p>
         </c>
         <c ca="center">
            <p>H</p>
         </c>
         <c ca="center">
            <p>12788 - 12851</p>
         </c>
         <c ca="center">
            <p>64</p>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>67.2</p>
         </c>
         <c ca="center">
            <p>0.0488</p>
         </c>
         <c ca="center">
            <p>-0.164</p>
         </c>
         <c ca="center">
            <p>1770(LNR3)</p>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>
               <b>
                  <it>trnY</it>
               </b>
            </p>
         </c>
         <c ca="center">
            <p>H</p>
         </c>
         <c ca="center">
            <p>12852 - 12912</p>
         </c>
         <c ca="center">
            <p>61</p>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>60.6</p>
         </c>
         <c ca="center">
            <p>0.000</p>
         </c>
         <c ca="center">
            <p>-0.0264</p>
         </c>
         <c ca="center">
            <p>0</p>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>
               <b>
                  <it>trnE</it>
               </b>
            </p>
         </c>
         <c ca="center">
            <p>L</p>
         </c>
         <c ca="center">
            <p>12908 - 12971</p>
         </c>
         <c ca="center">
            <p>64</p>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>68.7</p>
         </c>
         <c ca="center">
            <p>0.000</p>
         </c>
         <c ca="center">
            <p>0.0451</p>
         </c>
         <c ca="center">
            <p>-5</p>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>
               <b>
                  <it>trnQ</it>
               </b>
            </p>
         </c>
         <c ca="center">
            <p>L</p>
         </c>
         <c ca="center">
            <p>13002 - 13067</p>
         </c>
         <c ca="center">
            <p>66</p>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>77.5</p>
         </c>
         <c ca="center">
            <p>0.604</p>
         </c>
         <c ca="center">
            <p>-0.102</p>
         </c>
         <c ca="center">
            <p>30</p>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>
               <b>
                  <it>trnL1</it>
               </b>
            </p>
         </c>
         <c ca="center">
            <p>L</p>
         </c>
         <c ca="center">
            <p>13080 - 13143</p>
         </c>
         <c ca="center">
            <p>64</p>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>80.1</p>
         </c>
         <c ca="center">
            <p>0.431</p>
         </c>
         <c ca="center">
            <p>0.104</p>
         </c>
         <c ca="center">
            <p>12</p>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>
               <b>
                  <it>trnP</it>
               </b>
            </p>
         </c>
         <c ca="center">
            <p>L</p>
         </c>
         <c ca="center">
            <p>13178 - 13240</p>
         </c>
         <c ca="center">
            <p>63</p>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>76.2</p>
         </c>
         <c ca="center">
            <p>0.202</p>
         </c>
         <c ca="center">
            <p>-0.0420</p>
         </c>
         <c ca="center">
            <p>34</p>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>
               <b>
                  <it>trnM</it>
               </b>
            </p>
         </c>
         <c ca="center">
            <p>L</p>
         </c>
         <c ca="center">
            <p>13246 - 13309</p>
         </c>
         <c ca="center">
            <p>64</p>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>68.7</p>
         </c>
         <c ca="center">
            <p>-0.0990</p>
         </c>
         <c ca="center">
            <p>0.0917</p>
         </c>
         <c ca="center">
            <p>5</p>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>
               <b>
                  <it>trnK</it>
               </b>
            </p>
         </c>
         <c ca="center">
            <p>L</p>
         </c>
         <c ca="center">
            <p>13312 - 13374</p>
         </c>
         <c ca="center">
            <p>63</p>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>68.2</p>
         </c>
         <c ca="center">
            <p>-0.101</p>
         </c>
         <c ca="center">
            <p>0.0235</p>
         </c>
         <c ca="center">
            <p>2</p>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>
               <b>
                  <it>trnW</it>
               </b>
            </p>
         </c>
         <c ca="center">
            <p>L</p>
         </c>
         <c ca="center">
            <p>13376 - 13439</p>
         </c>
         <c ca="center">
            <p>64</p>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>78.1</p>
         </c>
         <c ca="center">
            <p>-0.142</p>
         </c>
         <c ca="center">
            <p>0.0807</p>
         </c>
         <c ca="center">
            <p>1</p>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>
               <b>
                  <it>trnS1</it>
               </b>
            </p>
         </c>
         <c ca="center">
            <p>L</p>
         </c>
         <c ca="center">
            <p>13439 - 13498</p>
         </c>
         <c ca="center">
            <p>60</p>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>77.0</p>
         </c>
         <c ca="center">
            <p>-0.144</p>
         </c>
         <c ca="center">
            <p>-0.0208</p>
         </c>
         <c ca="center">
            <p>-1</p>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>
               <b>
                  <it>trnN</it>
               </b>
            </p>
         </c>
         <c ca="center">
            <p>L</p>
         </c>
         <c ca="center">
            <p>13498 - 13565</p>
         </c>
         <c ca="center">
            <p>68</p>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>61.8</p>
         </c>
         <c ca="center">
            <p>-0.154</p>
         </c>
         <c ca="center">
            <p>0.0485</p>
         </c>
         <c ca="center">
            <p>-1</p>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>
               <b>
                  <it>cox2</it>
               </b>
            </p>
         </c>
         <c ca="center">
            <p>L</p>
         </c>
         <c ca="center">
            <p>13571 - 14275</p>
         </c>
         <c ca="center">
            <p>705</p>
         </c>
         <c ca="center">
            <p>ATT</p>
         </c>
         <c ca="center">
            <p>TAA</p>
         </c>
         <c ca="center">
            <p>62.6</p>
         </c>
         <c ca="center">
            <p>0.0749</p>
         </c>
         <c ca="center">
            <p>-0.166</p>
         </c>
         <c ca="center">
            <p>5</p>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>
               <b>
                  <it>nad3</it>
               </b>
            </p>
         </c>
         <c ca="center">
            <p>H</p>
         </c>
         <c ca="center">
            <p>14338 - 14691</p>
         </c>
         <c ca="center">
            <p>354</p>
         </c>
         <c ca="center">
            <p>ATT</p>
         </c>
         <c ca="center">
            <p>TAA</p>
         </c>
         <c ca="center">
            <p>61.3</p>
         </c>
         <c ca="center">
            <p>0.183</p>
         </c>
         <c ca="center">
            <p>-0.318</p>
         </c>
         <c ca="center">
            <p>62</p>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>
               <b>
                  <it>cox3</it>
               </b>
            </p>
         </c>
         <c ca="center">
            <p>L</p>
         </c>
         <c ca="center">
            <p>14794 - 15585</p>
         </c>
         <c ca="center">
            <p>792</p>
         </c>
         <c ca="center">
            <p>ATG</p>
         </c>
         <c ca="center">
            <p>TAA</p>
         </c>
         <c ca="center">
            <p>57.1</p>
         </c>
         <c ca="center">
            <p>0.0536</p>
         </c>
         <c ca="center">
            <p>-0.208</p>
         </c>
         <c ca="center">
            <p>102(LNR4)</p>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>
               <b>
                  <it>nad4</it>
               </b>
            </p>
         </c>
         <c ca="center">
            <p>H</p>
         </c>
         <c ca="center">
            <p>16357 - 17658</p>
         </c>
         <c ca="center">
            <p>1302</p>
         </c>
         <c ca="center">
            <p>ATA</p>
         </c>
         <c ca="center">
            <p>TAA</p>
         </c>
         <c ca="center">
            <p>59.9</p>
         </c>
         <c ca="center">
            <p>-0.0697</p>
         </c>
         <c ca="center">
            <p>-0.142</p>
         </c>
         <c ca="center">
            <p>771(LNR5)</p>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>
               <b>
                  <it>trnL2</it>
               </b>
            </p>
         </c>
         <c ca="center">
            <p>H</p>
         </c>
         <c ca="center">
            <p>17663 - 17728</p>
         </c>
         <c ca="center">
            <p>66</p>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>69.7</p>
         </c>
         <c ca="center">
            <p>0.102</p>
         </c>
         <c ca="center">
            <p>0.174</p>
         </c>
         <c ca="center">
            <p>4</p>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>
               <b>
                  <it>nad2</it>
               </b>
            </p>
         </c>
         <c ca="center">
            <p>H</p>
         </c>
         <c ca="center">
            <p>17729 - 18697</p>
         </c>
         <c ca="center">
            <p>969</p>
         </c>
         <c ca="center">
            <p>ATA</p>
         </c>
         <c ca="center">
            <p>TAA</p>
         </c>
         <c ca="center">
            <p>61.9</p>
         </c>
         <c ca="center">
            <p>-0.0761</p>
         </c>
         <c ca="center">
            <p>-0.202</p>
         </c>
         <c ca="center">
            <p>0</p>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>
               <b>
                  <it>atp8</it>
               </b>
            </p>
         </c>
         <c ca="center">
            <p>H</p>
         </c>
         <c ca="center">
            <p>18870 - 19031</p>
         </c>
         <c ca="center">
            <p>162</p>
         </c>
         <c ca="center">
            <p>ATT</p>
         </c>
         <c ca="center">
            <p>TAA</p>
         </c>
         <c ca="center">
            <p>70.4</p>
         </c>
         <c ca="center">
            <p>-0.0811</p>
         </c>
         <c ca="center">
            <p>-0.122</p>
         </c>
         <c ca="center">
            <p>172(LNR6)</p>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>
               <b>
                  <it>atp6</it>
               </b>
            </p>
         </c>
         <c ca="center">
            <p>H</p>
         </c>
         <c ca="center">
            <p>19034 - 19744</p>
         </c>
         <c ca="center">
            <p>711</p>
         </c>
         <c ca="center">
            <p>ATG</p>
         </c>
         <c ca="center">
            <p>TAG</p>
         </c>
         <c ca="center">
            <p>59.5</p>
         </c>
         <c ca="center">
            <p>-0.0963</p>
         </c>
         <c ca="center">
            <p>-0.126</p>
         </c>
         <c ca="center">
            <p>2</p>
         </c>
      </r>
   </tblbdy><tblfn>
      <p>Genes were labelled as outlined in the abbreviations section. AT skew = (A% - T%)/(A% + T%); GC skew = (G% - C%)/(C% + G%).</p>
      <p><sup>1 </sup>truncated stop codon, which is possibly completed via post-transcriptional adenylation;</p>
      <p><sup>2 </sup>unassigned nucleotides (positive values) or overlapped nucleotides (negative values) between two adjacent genes with large non-coding regions outlined;</p>
      <p><sup>3 </sup>initiation or termination positions of ribosomal RNAs defined by adjacent gene boundaries.</p>
   </tblfn></tbl>
</sec>
<sec>
<st>
<p>Base Composition and Codon Usage</p>
</st>
<p>The H-strand in the <it>C. sinicus </it>mitogenome comprises 32.1% A, 19.1% C, 19.3% G and 29.6% T. As presented in Table <tblr tid="T1">1</tblr>, the overall A + T content of <it>C. sinicus </it>is relatively low (61.7%) in comparison with other crustaceans, but within the range for copepods, a minimum of 60.4% in <it>T. japonicus </it>to a maximum of 70.8% in <it>P. nana </it>(Additional file <supplr sid="S1">1</supplr>). The same trend was observed in the protein coding genes (PCGs, 60.3%) and non-coding sequences (58.2%), which were lower than those in the majority of crustaceans. The A + T content of structural RNA genes was much richer, being 72.3% and 72.2% for tRNA and rRNA genes, respectively, which is comparable with other crustaceans.</p>
<suppl id="S1">
<title>
<p>Additional file 1</p>
</title>
<text>
<p>
<b>Comparison of the length, A + T-content and nucleotide compositional bias of mitochondrial genomes among copepods</b>. Values were obtained from the corresponding GenBank files. Detailed values are present for the complete genomes, overall PCGs, separate codon of PCGs and structural rRNA genes.
<sup>NA </sup>missing data due to incomplete sequencing of the mitogenome.</p>
</text>
<file name="1471-2164-12-73-S1.XLS">
   <p>Click here for file</p>
</file>
</suppl>
<p>Metazoan mitogenomes normally bear a clear strand asymmetry in terms of nucleotide composition owing to asymmetric deamination of A and C nucleotides on each strand during replication and/or transcription processes <abbrgrp>
<abbr bid="B24">24</abbr>
</abbrgrp>. However, there are approximately equal numbers of each complementary nucleotide pair in <it>C. sinicus</it>. When measured as AT- and GC-skews ((A%-T%)/(A% + T%) and (G%-C%)/(G% + C%)), the result is close to equality (0.00521) for the former and only moderately positive (0.0405) for the latter. Similar results have been reported in other copepods (Additional file <supplr sid="S1">1</supplr>; <it>P. nana</it>: AT-skew = -0.0457, GC-skew = 0.0598; <it>S. polycolpus</it>: AT-skew = -0.0389, GC-skew = 0.0102). In contrast to the whole genome, an anti-A skewness was apparent for all PCGs (-0.177; with a range between -0.318 for <it>nad3 </it>to -0.109 for <it>nad5</it>) regardless of the strand on which they were encoded, while adenines were slightly preferred in all rRNA genes (0.0125 for <it>rrnL </it>and 0.0369 for <it>rrnS</it>). As demonstrated in Table <tblr tid="T1">1</tblr> and Additional file <supplr sid="S1">1</supplr>, over-representation of guanines emerges in rRNA and tRNA genes. PCGs represent either neutral (<it>cytb </it>and <it>nad1</it>), negative (<it>nad4</it>, <it>nad2</it>, <it>atp8 </it>and <it>atp6</it>) or positive GC-skewness. Interestingly, all negative GC-skewed genes clustered in a gene block carrying the same transcriptional polarization, possibly because of an inversion or transcriptional polarization of the gene block.</p>
<p>To elucidate possible mechanisms that have shaped present-day nucleotide compositional strand asymmetry in the lineage, the GC-skewnesses for individual PCGs of copepods were compared with those of <it>Limulus polyphemus </it>(Figure <figr fid="F2">2</figr>). The strand asymmetric profiles of Copepods differed significantly from those of <it>L. polyphemus </it>in most PCGs. This suggests a global reversal of the skewness as a possible synapomorphy in the group, probably due to an ancestral inversion of the control region. However, specific nucleotide asymmetric profiles can be identified in all genes with the exception of <it>cox2 </it>and <it>nad3</it>, possibly because of a shift in the transcriptional polarization of the genes. The 3<sup>rd </sup>position of the PCGs is less constrained, and they tend to accumulate nucleotide skewness more quickly, making them more likely to be at equilibrium. The opposite results for the skewness at different codons in several genes could be evidence for their recent inversions. Therefore, a complex series of rearrangement events may have occurred in this lineage.</p>
<fig id="F2"><title><p>Figure 2</p></title><caption><p>Strand compositional asymmetry in Copepoda</p></caption><text>
   <p><b>Strand compositional asymmetry in Copepoda</b>. GC skewness was calculated for 1<sup>st </sup>plus 2<sup>nd </sup>(on the left) and 3<sup>rd </sup>(on the right) codon positions for the 13 protein-coding genes of Copepoda. For each plot, the values for <it>Limulus polyphemus </it>are given. Genes were abbreviated as outlined in the abbreviations section.</p>
</text><graphic file="1471-2164-12-73-2" hint_layout="double"/></fig>
<p>The pattern of codon usage in the <it>C. sinicus </it>mitogenome was studied (Additional file <supplr sid="S2">2</supplr>). A preference for AT-rich codons was identified in <it>C. sinicus</it>, as is the case in mitochondrial PCGs of other arthropods. For example, the most frequently used codons are UUU(F) (63 codons per 1000 codons), followed by AUU(I) (54 codons per 1000 codons) and then AUA(M) (49 codons per 1000 codons). Among copepods, the A + T content of the overall mitogenome is highly correlated with the corresponding values in degenerate synonymous sites of protein coding genes (R<sup>2 </sup>= 0.9918). The A + T-content of the 3<sup>rd </sup>codon positions (62.6%) in <it>C. sinicus</it>, which is only slightly higher than that in <it>T. japonicus</it>, is lower than that in most other crustaceans.</p>
<suppl id="S2">
<title>
<p>Additional file 2</p>
</title>
<text>
<p>
<b>Codon usage for the protein-coding genes in the mitogenome of <it>C. sinicus</it>
</b>. n indicates the total number of codons used in all 13 mitochondrial protein-coding genes.</p>
</text>
<file name="1471-2164-12-73-S2.XLS">
   <p>Click here for file</p>
</file>
</suppl>
</sec>
<sec>
<st>
<p>Protein coding genes</p>
</st>
<p>More than one reasonable start codon can be predicted for several genes. Therefore, start codons were selected from the candidates following criteria to avoid large overlaps between genes and to keep a conserved length with other crustaceans. There are a total of 11,137 nucleotides encoding 13 protein coding genes in <it>C. sinicus</it>, which are at least 147 nt longer than other copepod mitogenomes. The genes <it>atp6 </it>and <it>atp8 </it>are heavily truncated in other copepods but maintain the regular size in <it>C. sinicus</it>, predominantly contributing to the elongation of mitochondrial PCGs. Each of the 13 protein-coding genes in <it>C. sinicus </it>start with a typical initiation codon ATD: ATA for <it>cox1</it>, <it>nad1</it>, <it>nad2</it>, <it>nad4 </it>and <it>nad4L</it>, ATT for <it>atp8</it>, <it>cox2</it>, <it>nad3 </it>and <it>nad5</it>-<it>6</it>, and ATG for the remainder. Previous studies have reported several atypical initiation codons for <it>cox1 </it>in arthropods <abbrgrp>
<abbr bid="B25">25</abbr>
</abbrgrp>. However, the copepods studied to date possess a regular start codon (ATA) for <it>cox1</it>.</p>
<p>The majority of the 13 protein coding genes terminate with the conventional stop codons TAG or TAA, but <it>nad1 </it>and <it>nad5 </it>have truncated stop codons (TA) adjacent to a downstream tRNA gene. The presence of incomplete stop codons is common in metazoan mitogenomes, and the shortened stop codons are likely to be completed via post-transcriptional polyadenylation <abbrgrp>
<abbr bid="B26">26</abbr>
</abbrgrp>.</p>
<p>In view of the clear saturated mutation at the nucleotide level, the amino acids of PCGs were compared among copepods. As illustrated in Table <tblr tid="T2">2</tblr>, the overall amino acid divergences among the copepods was particularly high, ranging from 0.238 in <it>cox1 </it>to 0.768 in <it>nad4L</it>. In general, genes encoding proteins for complex I (<it>nad1</it>-<it>6</it>, <it>nad4L</it>) of the electron transport chain were less conserved than others. Altered mutation rates and relatively relaxed selective constraints <abbrgrp>
<abbr bid="B27">27</abbr>
</abbrgrp> are the two possible factors responsible for elevated divergence in mitochondrial genes for complex I. However, NADH genes are dispersed within mitogenomes of copepods. Therefore, it is unlikely that several NADH genes would possess altered mutation rates with the same trend, simultaneously. The latter interpretation seems most plausible. Structural or functional constraints at the protein level can lead to locus-specific selective pressures acting on mitochondrial genomes, giving rise to a higher divergence in some PCGs.</p>
<tbl id="T2"><title><p>Table 2</p></title><caption><p>Genetic divergence of the mitochondrial genes among five copepods and 11 individuals of <it>C. sinicus</it>.</p></caption><tblbdy cols="5">
      <r>
         <c ca="center">
            <p>
               <b>
                  <it>gene</it>
               </b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>
                  <it>DB</it>
               </b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>
                  <it>&#969;B</it>
               </b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>
                  <it>DW</it>
               </b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>
                  <it>&#969;W</it>
               </b>
            </p>
         </c>
      </r>
      <r>
         <c cspan="5">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>
               <b>
                  <it>atp6</it>
               </b>
            </p>
         </c>
         <c ca="center">
            <p>0.557</p>
         </c>
         <c ca="center">
            <p>0.0736</p>
         </c>
         <c ca="center">
            <p>0.00102</p>
         </c>
         <c ca="center">
            <p>0.157</p>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>
               <b>
                  <it>atp8</it>
               </b>
            </p>
         </c>
         <c ca="center">
            <p>0.519</p>
         </c>
         <c ca="center">
            <p>NA<sup>1</sup></p>
         </c>
         <c ca="center">
            <p>0.00314</p>
         </c>
         <c ca="center">
            <p>0.000</p>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>
               <b>
                  <it>cox1</it>
               </b>
            </p>
         </c>
         <c ca="center">
            <p>0.238</p>
         </c>
         <c ca="center">
            <p>0.0283</p>
         </c>
         <c ca="center">
            <p>0.00115</p>
         </c>
         <c ca="center">
            <p>0.000</p>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>
               <b>
                  <it>cox2</it>
               </b>
            </p>
         </c>
         <c ca="center">
            <p>0.488</p>
         </c>
         <c ca="center">
            <p>0.0369</p>
         </c>
         <c ca="center">
            <p>0.00098</p>
         </c>
         <c ca="center">
            <p>0.885</p>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>
               <b>
                  <it>cox3</it>
               </b>
            </p>
         </c>
         <c ca="center">
            <p>0.372</p>
         </c>
         <c ca="center">
            <p>0.0074</p>
         </c>
         <c ca="center">
            <p>0.00211</p>
         </c>
         <c ca="center">
            <p>0.000</p>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>
               <b>
                  <it>cytb</it>
               </b>
            </p>
         </c>
         <c ca="center">
            <p>0.395</p>
         </c>
         <c ca="center">
            <p>0.0026</p>
         </c>
         <c ca="center">
            <p>0.00070</p>
         </c>
         <c ca="center">
            <p>0.000</p>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>
               <b>
                  <it>nad1</it>
               </b>
            </p>
         </c>
         <c ca="center">
            <p>0.528</p>
         </c>
         <c ca="center">
            <p>0.0926</p>
         </c>
         <c ca="center">
            <p>0.00115</p>
         </c>
         <c ca="center">
            <p>0.159</p>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>
               <b>
                  <it>nad2</it>
               </b>
            </p>
         </c>
         <c ca="center">
            <p>0.753</p>
         </c>
         <c ca="center">
            <p>0.0100</p>
         </c>
         <c ca="center">
            <p>0.00255</p>
         </c>
         <c ca="center">
            <p>0.222</p>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>
               <b>
                  <it>nad3</it>
               </b>
            </p>
         </c>
         <c ca="center">
            <p>0.581</p>
         </c>
         <c ca="center">
            <p>0.0923</p>
         </c>
         <c ca="center">
            <p>0.00103</p>
         </c>
         <c ca="center">
            <p>0.358</p>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>
               <b>
                  <it>nad4</it>
               </b>
            </p>
         </c>
         <c ca="center">
            <p>0.668</p>
         </c>
         <c ca="center">
            <p>0.0534</p>
         </c>
         <c ca="center">
            <p>0.00184</p>
         </c>
         <c ca="center">
            <p>0.914</p>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>
               <b>
                  <it>nad4L</it>
               </b>
            </p>
         </c>
         <c ca="center">
            <p>0.768</p>
         </c>
         <c ca="center">
            <p>0.0617</p>
         </c>
         <c ca="center">
            <p>0.000</p>
         </c>
         <c ca="center">
            <p>0.000</p>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>
               <b>
                  <it>nad5</it>
               </b>
            </p>
         </c>
         <c ca="center">
            <p>0.625</p>
         </c>
         <c ca="center">
            <p>0.0306</p>
         </c>
         <c ca="center">
            <p>0.00173</p>
         </c>
         <c ca="center">
            <p>0.279</p>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>
               <b>
                  <it>nad6</it>
               </b>
            </p>
         </c>
         <c ca="center">
            <p>0.732</p>
         </c>
         <c ca="center">
            <p>0.0087</p>
         </c>
         <c ca="center">
            <p>0.00333</p>
         </c>
         <c ca="center">
            <p>0.435</p>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>
               <b>
                  <it>rrnS</it>
               </b>
            </p>
         </c>
         <c ca="center">
            <p>0.405</p>
         </c>
         <c ca="center">
            <p>NA<sup>2</sup></p>
         </c>
         <c ca="center">
            <p>0.00078</p>
         </c>
         <c ca="center">
            <p>NA<sup>2</sup></p>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>
               <b>
                  <it>rrnL</it>
               </b>
            </p>
         </c>
         <c ca="center">
            <p>0.409</p>
         </c>
         <c ca="center">
            <p>NA<sup>2</sup></p>
         </c>
         <c ca="center">
            <p>0.00163</p>
         </c>
         <c ca="center">
            <p>NA<sup>2</sup></p>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>
               <b>tRNA</b>
            </p>
         </c>
         <c ca="center">
            <p>NA<sup>1</sup></p>
         </c>
         <c ca="center">
            <p>NA<sup>2</sup></p>
         </c>
         <c ca="center">
            <p>0.00060</p>
         </c>
         <c ca="center">
            <p>NA<sup>2</sup></p>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>
               <b>overall</b>
            </p>
         </c>
         <c ca="center">
            <p>0.525</p>
         </c>
         <c ca="center">
            <p>0.0490</p>
         </c>
         <c ca="center">
            <p>0.00150</p>
         </c>
         <c ca="center">
            <p>0.203</p>
         </c>
      </r>
   </tblbdy><tblfn>
      <p>Five copepods, the PCGs of which have been entirely determined, were selected for interspecific analysis. For the PCGs, divergence of amino acid was compared for the interspecific analyses, and the nucleotide divergence was outlined for the intraspecific comparisons. Ratios of non-synonymous to synonymous substitutions for the 13 PCGs were compared. Genes were labelled as outlined in the abbreviations section. DB and &#969;B: p-distance and dN/dS among five copepods; DW and &#969;W: p-distance and dN/dS within <it>C. sinicus</it>.</p>
      <p><sup>1 </sup>missing data due to the incomplete dataset;</p>
      <p><sup>2 </sup>unfeasible parameter for the corresponding genes.</p>
   </tblfn></tbl>
<p>To examine the evolutionary forces acting on the mitochondrial PCGs of copepods, rates of non-synonymous substitution (dN) <it>versus </it>synonymous substitution (dS) were determined. The observed dN/dS ratios (Table <tblr tid="T2">2</tblr>) were consistently lower than one, increasing from 0.0026 for <it>cytb </it>to 0.0926 for <it>nad1</it>. This indicates a strong purifying selection within this lineage. Values of dN/dS for genes of sparse polymorphism (<it>cox1</it>-<it>3</it>, <it>cytb</it>) were generally lower, in agreement with the idea that highly divergent genes are normally subjected to less selective pressure.</p>
</sec>
<sec>
<st>
<p>Ribosomal RNA genes</p>
</st>
<p>In the mitogenome of <it>C. sinicus</it>, the 16S ribosomal RNA (<it>rrnL</it>) and 12S ribosomal RNA (<it>rrnS</it>) genes are located between <it>trnV</it>/LNR1 and <it>trnG</it>/<it>nad6</it>, respectively. In arthropods, the rRNA genes are normally adjacent on the same strand, interleaved by a single <it>trnV</it>. However, the two genes are distantly separated on either strand in <it>C. sinicus</it>, which is rare in metazoans. Examples of the arrangement are mainly found in the primary lineages <abbrgrp>
<abbr bid="B28">28</abbr>
</abbrgrp>. The size of <it>rrnS </it>and <it>rrnL </it>genes in <it>C. sinicus </it>were calculated to be 654 bp and 1,139 bp, respectively, on the basis of the alignment and comparison of their counterparts in <it>N. cristatus</it>. These lengths are similar to those of <it>P. nana</it>, but longer than corresponding lengths of other copepods. Consistent with PCGs, the two rRNA genes were determined to be highly divergent, with values of 0.405 and 0.409 for <it>rrnS </it>and <it>rrnL</it>, respectively (Table <tblr tid="T2">2</tblr>). The secondary structure of <it>rrnS </it>(Figure <figr fid="F3">3</figr>) was proposed on the basis of the model of Gutell <abbrgrp>
<abbr bid="B29">29</abbr>
</abbrgrp>, and <it>rrnL </it>(Figure <figr fid="F4">4</figr>) of the model of De Rijk et al. <abbrgrp>
<abbr bid="B30">30</abbr>
</abbrgrp>. In accordance with their phylogenetic relationships, the secondary structures of <it>C. sinicus </it>rRNAs resembled those of crustacean <abbrgrp>
<abbr bid="B31">31</abbr>
</abbrgrp> (<it>Daphnia pulex</it>) more closely than those of insect (<it>Drosophila yakuba</it>, secondary structures obtained from The European ribosomal RNA database <abbrgrp>
<abbr bid="B32">32</abbr>
</abbrgrp>).</p>
<fig id="F3"><title><p>Figure 3</p></title><caption><p>The inferred secondary structure of the <it>rrnS </it>of <it>C. sinicus</it></p></caption><text>
   <p><b>The inferred secondary structure of the <it>rrnS </it>of <it>C. sinicus</it></b>. Inferred nucleotide bonds are illustrated by lines. The secondary structure was based on the model of Gutell (1994).</p>
</text><graphic file="1471-2164-12-73-3" hint_layout="single"/></fig>
<fig id="F4"><title><p>Figure 4</p></title><caption><p>The inferred secondary structure of the <it>rrnL </it>of <it>C. sinicus</it></p></caption><text>
   <p><b>The inferred secondary structure of the <it>rrnL </it>of <it>C. sinicus</it></b>. Inferred nucleotide bonds are illustrated by lines. Helix numbering follows that of De Rijk et al. (1997).</p>
</text><graphic file="1471-2164-12-73-4" hint_layout="single"/></fig>
<p>Compared with the insect <it>D. yakuba</it>, several compound helixes degenerate into a single one in the crustacean <it>rrnS </it>secondary structures. Both crustaceans lack helixes 8, 12, 39 and 41 whereas the counterparts are present in <it>D. yakuba</it>. All helixes present in <it>D. pulex </it>are shared by <it>C. sinicus </it>with the exception of helix 1. However, most loops and linking sequences between helixes are somewhat reduced, leading to a much shorter <it>rrnS </it>in <it>C. sinicus</it>. The alignment of <it>rrnS </it>genes for copepods indicates that 5' sequences upstream of helix 32 are more variable.</p>
<p>In terms of <it>rrnL</it>, upstream sequences of helix C1 were too ambitious to align. High diversity in this region has been reported in several species <abbrgrp>
<abbr bid="B33">33</abbr>
<abbr bid="B34">34</abbr>
<abbr bid="B35">35</abbr>
</abbrgrp>, where some or all helixes are truncated <abbrgrp>
<abbr bid="B33">33</abbr>
<abbr bid="B35">35</abbr>
</abbrgrp>. Helix G13, present in <it>D. yakuba</it>, is absent in <it>C. sinicus </it>and <it>D. pulex</it>. In addition, the compound Helix D13/D14 is replaced by one stem-loop, and Helix H3 is absent in <it>C. sinicus</it>. The greatest sequence conservation was at the 3' end from Helix G2 to H2, consistent with the idea that this region is the main component of the transferase centre <abbrgrp>
<abbr bid="B36">36</abbr>
</abbrgrp>.</p>
</sec>
<sec>
<st>
<p>Transfer RNA genes</p>
</st>
<p>Though only partially sequenced, 20 of the 22 mitochondrial tRNA genes have been identified in the <it>C. sinicus </it>mitogenome on the basis of their potential cloverleaf structures and anti-codons (Table <tblr tid="T1">1</tblr>, Figure <figr fid="F5">5</figr>). Four tRNA genes overlap with one to five shared nucleotides. The extreme example was identified at the junction between <it>trnY </it>and <it>trnE</it>. The overlapped portions can be repaired by a post-transcriptional editing process <abbrgrp>
<abbr bid="B36">36</abbr>
</abbrgrp>. Gene lengths (57 to 68 nucleotides) and anti-codon usage are comparable with those generally observed in arthropods. However, <it>trnK </it>and <it>trnS1 </it>(AGN) utilize TTT and TCT rather than the more common CTT and GCT. Such substitutions on wobble positions can be found in other invertebrate mitogenomes <abbrgrp>
<abbr bid="B37">37</abbr>
</abbrgrp>. As in other metazoans, anti-codons occasionally diverge from the most commonly used codons in degenerate codon families. For example, the most frequently used codon (AUA) for Met contradicts the corresponding anti-codon (AUG).</p>
<fig id="F5"><title><p>Figure 5</p></title><caption><p>Putative secondary structures of tRNAs in mitochondrial genome of <it>C. sinicus</it></p></caption><text>
   <p><b>Putative secondary structures of tRNAs in mitochondrial genome of <it>C. sinicus</it></b>. The tRNAs are labelled with the abbreviations of their corresponding amino acids. Each structural element is illustrated in <it>trnW </it>and <it>trnY</it>. Canonical cloverleaf structures are assumed in all tRNAs, with the exception of <it>trnS2 </it>(UCN).</p>
</text><graphic file="1471-2164-12-73-5" hint_layout="double"/></fig>
<p>Complete cloverleaf structures containing the T&#936;C stem (mostly 3-5 bp) and loop (3-7 nt), variable loop, anti-codon stem (5 bp) and loop (7 nt), DHU (mostly 3-4 bp) stem and loop (highly variable from 3 to 10 nt), and the acceptor stem (7 bp), can be predicted for 19 tRNAs, whereas the DHU arm is absent in <it>trnS2 </it>(UCN). In addition, the DHU arm for another <it>trnS1 </it>(AGN) is highly reduced, leaving a short stem (2 nt) and a small loop (3 nt). Degenerative or unpaired DHU arms in <it>trnS </it>are considered to be a common condition in arthropods <abbrgrp>
<abbr bid="B38">38</abbr>
</abbrgrp>, and particularly in copepods <abbrgrp>
<abbr bid="B18">18</abbr>
<abbr bid="B20">20</abbr>
</abbrgrp>. As for other arthropods, the anti-codon is preceded by a uracil and followed by a purine in <it>C. sinicus</it>. Deviating from the canonical mitochondrial tRNAs with four nucleotides in the variable loops, 5 nt variable loops were identified in <it>trnI </it>and <it>trnS1 </it>(AGN), and 3 nt variable loops were identified in <it>trnS2 </it>(UCN).</p>
</sec>
<sec>
<st>
<p>Non-coding sequences</p>
</st>
<p>Within the fragment determined, there were 6,270 bp non-coding sequences in total (approximate 30% of complete sequence) distributed among 23 intergenic regions. Six long non-coding regions larger than 100 bp were identified between <it>atp6 </it>and <it>rrnL </it>(LNR1; &gt;2,959 bp; not sequenced completely), <it>cox1 </it>and <it>nad4L </it>(LNR2; 128 bp), <it>trnH </it>and <it>trnA </it>(LNR3; 1,770 bp), <it>nad3 </it>and <it>cox3 </it>(LNR4; 102 bp), <it>cox3 </it>and <it>nad4 </it>(LNR5; 762 bp), and <it>nad2 </it>and <it>atp8 </it>(LNR6; 172 bp). Six additional non-coding regions larger than 30 bp were discovered. The mitogenome of <it>C. sinicus </it>is one of the largest among arthropods owing to the prevalence and enlargement of non-coding regions. The concurrence of numerous large non-coding regions is unusual <abbrgrp>
<abbr bid="B39">39</abbr>
</abbrgrp>. Because of the deletional bias, large inactive regions tend to be eliminated from mitogenomes so that they become economical <abbrgrp>
<abbr bid="B40">40</abbr>
</abbrgrp>. Intergenic spacers are normally limited in number and size. As far as crustaceans are concerned, most mitochondrial genomes reported so far possess a single long non-coding region. Exceptions to this include <it>Speleonectes tulumensis</it>, <it>Hutchinsoniella macracantha </it>
<abbrgrp>
<abbr bid="B41">41</abbr>
</abbrgrp> and <it>Geothelphusa dehaani </it>
<abbrgrp>
<abbr bid="B34">34</abbr>
</abbrgrp>. The largest non-coding sequences, rather than CRs, are usually smaller than 40 bp <abbrgrp>
<abbr bid="B34">34</abbr>
</abbrgrp>. To elucidate the origin of multiple non-coding regions, BLAST analysis was conducted on LNRs. With the exception of LNR2, in which a 26 bp stretch similar to other crustacean <it>rrnS </it>was screened, the BLAST analysis revealed that the LNRs of <it>C. sinicus </it>shared no significant similarities to any known sequences. Therefore, independent origins and evolutionary processes are likely to have given rise to the various non-coding regions.</p>
<p>Generally, large non-coding sequences act as control regions to initiate and/or regulate mitochondrial transcription and replication. However, functions of multiple heterologous non-coding regions are difficult to predict. AT-richness is broadly accepted as a characteristic for the identification of CRs. However, various cases have been reported <abbrgrp>
<abbr bid="B25">25</abbr>
</abbrgrp>, and appear to be common in copepods. Of five copepods, three possess equal (<it>L. salmonis </it>
<abbrgrp>
<abbr bid="B20">20</abbr>
</abbrgrp>) or lower (<it>T. japonicus </it>
<abbrgrp>
<abbr bid="B18">18</abbr>
</abbrgrp> and <it>T. californicus </it>
<abbrgrp>
<abbr bid="B14">14</abbr>
</abbrgrp>) A + T-contents in their control regions. Relatively lower AT-contents are present in <it>C. sinicus </it>LNRs with the exception of LNR2 (68.0%). Although conserved sequence blocks (CSBs) are common in control regions of metazoans <abbrgrp>
<abbr bid="B26">26</abbr>
<abbr bid="B42">42</abbr>
</abbrgrp>, such conservative properties among copepods were not detected. Therefore, the control regions were screened on the basis of the secondary structure motifs.</p>
<p>Several secondary structure motifs commonly found in control regions of arthropods <abbrgrp>
<abbr bid="B42">42</abbr>
<abbr bid="B43">43</abbr>
<abbr bid="B44">44</abbr>
<abbr bid="B45">45</abbr>
</abbrgrp> were identified in LNR3 including: (1) a poly-T stretch 360 bp to the 3' end of LNR3; (2) a hairpin structure (Additional file <supplr sid="S3">3</supplr>) on the L-strand 140 bp downstream of the poly-T stretch; (3) conserved sequences at the lateral ends of the hairpin structure; and (4) a microsatellite locus following the hairpin structure with "AT" as the core repeat (14-28 repetitions). These motifs make LNR3 the most likely candidate for the mitochondrial control region. However, hairpin structures were identified in other LNRs, which could be related to the mode of regulation of replication and transcription. Considering the extreme complexity of the non-coding sequences in <it>C. sinicus</it>, more comparative and functional analyses are required to elucidate their exact roles during mitochondrial metabolism.</p>
<suppl id="S3">
<title>
<p>Additional file 3</p>
</title>
<text>
<p>
<b>Stem-loop structures in the putative control region of <it>C. sinicus</it>
</b>. Potential hairpin structures within the LNR3 between <it>trnH </it>and <it>trnA </it>were constructed using UNAfold. Conserved motifs in 3' and 5' flanking sequences are underlined. The depicted region corresponds to the complementary strand of 11522 -11708 bp in the submitted sequence.</p>
</text>
<file name="1471-2164-12-73-S3.PNG">
   <p>Click here for file</p>
</file>
</suppl>
</sec>
<sec>
<st>
<p>Mitochondrial gene order</p>
</st>
<p>In addition to the multiplication of LNRs, a notably shuffled gene order was present. Similar features were identified in a nonbilaterian species, <it>Nemateleotris magnifica </it>
<abbrgrp>
<abbr bid="B46">46</abbr>
</abbrgrp>, but are rare in bilaterians <abbrgrp>
<abbr bid="B10">10</abbr>
</abbrgrp>. Mitochondrial gene order rearrangements are common among crustaceans, particularly in copepods <abbrgrp>
<abbr bid="B19">19</abbr>
<abbr bid="B21">21</abbr>
<abbr bid="B23">23</abbr>
<abbr bid="B25">25</abbr>
</abbrgrp>. In the case of <it>C. sinicus</it>, the mitochondrial genome is heavily rearranged. Compared with other mitogenomes in the MITOME database (<url>http://www.mitome.info</url>), <it>C. sinicus </it>has a unique gene order. Comparison of the <it>C. sinicus </it>mitogenome to the ground pattern for arthropod mitogenomes <abbrgrp>
<abbr bid="B47">47</abbr>
</abbrgrp> (Figure <figr fid="F6">6</figr>) revealed that the mitogenome was reshuffled. Translocations were identified for all tRNA genes in the mitogenome of <it>C. sinicus</it>. Among 36 gene boundaries, only two adjoining <it>atp6-atp8 </it>and <it>nad6</it>-<it>cytb </it>were retained. Moreover, 12 genes (34.2%) have developed a contrary transcriptional polarization: <it>trnD</it>, <it>atp6</it>, <it>atp8</it>, <it>nad3</it>, <it>trnA </it>and <it>nad2 </it>inverted from the L-strand to the H-strand, whereas <it>trnF</it>, <it>trnP</it>, <it>nad1</it>, <it>trnK</it>, <it>rrnS </it>and <it>trnQ </it>were shifted from the H-strand to the L-strand.</p>
<fig id="F6"><title><p>Figure 6</p></title><caption><p>Comparison of the mitochondrial gene arrangements in copepods and the putative arthropod ground pattern</p></caption><text>
   <p><b>Comparison of the mitochondrial gene arrangements in copepods and the putative arthropod ground pattern</b>. All tRNA genes and control regions were excluded for clarity. Gene segments are abbreviated as described in the main text, but are not drawn to scale. All elements were transcribed from left to right, with the exception of those depicted by shaded boxes. The horizontal lines illustrate gene blocks that are present in the Arthropod ground pattern. Large scale gene arrangements are apparent in copepods with all genes reshuffled.</p>
</text><graphic file="1471-2164-12-73-6" hint_layout="double"/></fig>
<p>In accordance with the high rate of sequence divergence between lineages, scrambled gene orders were observed among copepods. Large-scale gene rearrangements were identified within the family Calanidae <abbrgrp>
<abbr bid="B23">23</abbr>
</abbrgrp>. Owing to the small size and special secondary structures, tRNA genes are more mobile, and account for most translocations in crustaceans <abbrgrp>
<abbr bid="B25">25</abbr>
<abbr bid="B34">34</abbr>
</abbrgrp>. To avoid confusion initiated by reversal translocations of tRNA genes, the discussion of gene order is restricted to the protein-coding and rRNA genes (Figure <figr fid="F6">6</figr>). Complete reshuffling can be found in all copepod mitogenomes, leading to a divergent pattern of gene order in this group.</p>
<p>Translocations involving protein coding or rRNA genes are rare in metazoans <abbrgrp>
<abbr bid="B10">10</abbr>
</abbrgrp>. Such dramatic rearrangement of genes in copepods challenges the view of conservation of mitochondrial gene order, as suggested by unusual gene translocations in molluscs <abbrgrp>
<abbr bid="B48">48</abbr>
<abbr bid="B49">49</abbr>
</abbrgrp>. When pairwise gene orders of copepods are compared, there are few common intervals (0 to 40) indicated by results from CREx <abbrgrp>
<abbr bid="B50">50</abbr>
</abbrgrp>; exceptions are two siphonostomatoid species belonging to the family Caligidae (Additional file <supplr sid="S4">4</supplr>). No conserved synteny is shared by the copepod samples studied here, questioning their homologous status. The similarities of gene order within the family Calanidae and between the orders were compared, with no significant differences being identified. Therefore, the phylogenetic signal may be diluted by frequent gene rearrangements within the lineage. The lack of unambiguous synapomorphic gene arrangements in copepods precludes their use in phylogenetic analysis concerning Copepoda.</p>
<suppl id="S4">
<title>
<p>Additional file 4</p>
</title>
<text>
<p>
<b>Pairwise comparison of mitochondrial gene orders among copepods</b>. tRNA genes were not included. Common intervals (above), defined as the number of shared gene blocks inside a block independent of their gene orders, were calculated for comparison.</p>
</text>
<file name="1471-2164-12-73-S4.XLS">
   <p>Click here for file</p>
</file>
</suppl>
<p>With regard to the rearrangement of mitochondrial genes, two major categories of mechanisms have been advanced: (1) tandem duplication followed by random or non-random deletion of excess genes <abbrgrp>
<abbr bid="B51">51</abbr>
<abbr bid="B52">52</abbr>
</abbrgrp>; and (2) non-homologous recombination <abbrgrp>
<abbr bid="B53">53</abbr>
<abbr bid="B54">54</abbr>
</abbrgrp>. The first scenario is improbable in view of the presence of inversion or the absence of a conserved synteny. Consequently, involvement of non-homologous recombination, which can invoke translocation and inversion, may be required. To date, there is no direct evidence of mitochondrial DNA recombination in copepods. However, new evidence supporting recombination is emerging in invertebrates including molluscs <abbrgrp>
<abbr bid="B55">55</abbr>
</abbrgrp>, nematodes <abbrgrp>
<abbr bid="B56">56</abbr>
</abbrgrp> and arthropods <abbrgrp>
<abbr bid="B35">35</abbr>
</abbrgrp>. Furthermore, the problematic <it>circular subgenomic fragment </it>identified in <it>C. sinicus </it>may provide additional insights concerning mitochondrial DNA recombination in copepods.</p>
</sec>
<sec>
<st>
<p>Technical problems during laboratory work</p>
</st>
<p>During the experiments an 18.6 kb DNA sequence was amplified, with reverse complementarity in its 505 bp long lateral ends. Such a covalent sequence, which represents a circular DNA molecule, is normally considered as a marker for the achievement of mitochondrial sequencing. However, the 4.5 kb fragment at the 3' end of the sequence was incomplete, with several mitochondrial elements being absent (see Methods section for details). Extended sequences were successfully determined with step-out PCRs <abbrgrp>
<abbr bid="B57">57</abbr>
</abbrgrp>, and verified by Long-PCR. Therefore, the circular molecule was confirmed to be a sub-genomic fragment nested within the complete mtDNA (<it>circular subgenomic fragment</it>). The occurrence of the problematic <it>circular subgenomic fragment </it>could be explained by the following scenarios: First, the 4.5 kb fragment could be nuclear copies of mitochondrial fragments (<it>NUMT</it>s) <abbrgrp>
<abbr bid="B58">58</abbr>
</abbrgrp>. <it>NUMT</it>s are normally composed of fragmented copies shorter than 4 kb <abbrgrp>
<abbr bid="B59">59</abbr>
<abbr bid="B60">60</abbr>
</abbrgrp>. Pseudogenes provide another notorious characteristic for the <it>NUMT</it>s <abbrgrp>
<abbr bid="B60">60</abbr>
</abbrgrp>. However, sequences of the coding genes in this problematic fragment compare favourably with the counterparts obtained from the cDNA templates. Indel and nonsense mutations were not present. Therefore, because the sequences of coding genes in the 4.5 kb fragment are almost identical to those obtained from reverse transcriptase PCR (unpublished data), with only three substitutions in the 1372 nucleotides being compared, this first scenario is unlikely.</p>
<p>Second, the 4.5 kb fragment could be an artefact of PCR jumping when site-specific lesions exist or initial copies in the template are few <abbrgrp>
<abbr bid="B61">61</abbr>
</abbrgrp>. Fresh materials were used for the amplification to reduce the possibility of PCR jumping as breaks in the template would give rise to the bouncing of primers. Unfortunately, abnormal nucleotides or stable stem-and-loop structures in the unidentified regions may have acted in a similar manner to breaks, causing the extending primer to jump to another template during PCR.</p>
<p>Finally, the <it>circular subgenomic fragment </it>could be the product of mitochondrial DNA recombination. Recombination is normally absent in mitochondrial genomes of metazoans, but convincing evidence for this process has emerged <abbrgrp>
<abbr bid="B56">56</abbr>
<abbr bid="B58">58</abbr>
<abbr bid="B62">62</abbr>
<abbr bid="B63">63</abbr>
</abbrgrp>. A defined feature of recombination is the breakage and rejoining of participating DNA strands <abbrgrp>
<abbr bid="B53">53</abbr>
</abbrgrp>. According to Lunt's recombination model, subgenomic circular molecules with the same gene organizations but smaller in size could be generated during recombination. The erroneous fragment mentioned above is consistent with Lunt's model <abbrgrp>
<abbr bid="B53">53</abbr>
</abbrgrp>, suggesting that recombination occurred. The results highlight the possibility of mitochondrial DNA recombination in copepods.</p>
<p>Such puzzles may be common to all copepod studies and caution should be applied when using long PCR technology to define complete mitochondrial genomes. Additional long PCRs are required to confirm whether the mitochondrial genome sequence is complete.</p>
</sec>
<sec>
<st>
<p>Phylogenetic analysis</p>
</st>
<p>Homogeneity of the stationary frequencies across the tree is a baseline for current phylogenetic models. Therefore, amino acid alignments were used for inference of phylogeny as they are more homogenous among different lineages than nucleotide content <abbrgrp>
<abbr bid="B64">64</abbr>
</abbrgrp>. As presented in Figures <figr fid="F7">7</figr> and <figr fid="F8">8</figr>, monophyly of Pancrustacea and most of its high-level subtaxa including the classes Collembola, Diplura, Insecta and Malacostraca, and the subclasses Copepoda and Cirripedia, were supported irrespective of the model and method applied. Bootstrap support values (BP) from maximum likelihood (ML) analyses were usually lower in the current analysis, suggesting that the phylogenetic signal was weak or that a competing artificial signal was present. ML analyses are believed to be vulnerable to several factors including lineage-specific evolutionary rate heterogeneities and nucleotide compositional heterogeneities, which can impede the recovery of phylogenetic signals <abbrgrp>
<abbr bid="B16">16</abbr>
<abbr bid="B64">64</abbr>
</abbrgrp>. Hence, this study concentrated on the topology recovered by Bayesian inference (BI), but included ML trees.</p>
<fig id="F7"><title><p>Figure 7</p></title><caption><p>Phylogenetic relationships of major pancrustacean lineages inferred from concatenated amino acids of 12 mitochondrial protein coding genes (original dataset)</p></caption><text>
   <p><b>Phylogenetic relationships of major pancrustacean lineages inferred from concatenated amino acids of 12 mitochondrial protein coding genes (original dataset)</b>. Two Chelicerate and three Myriapoda species were used as out-groups. The topology was represented by the result obtained from PhyloBayes under the model of CAT + mtArt. Each group of three numbers at the branch nodes (clockwise) refer to the Bayesian posterior probabilities using PhyloBayes and MrBayes, and bootstrap support values using PHYML. The scale bar represents substitutions per site. '-' indicates the node was not supported by the corresponding analysis.</p>
</text><graphic file="1471-2164-12-73-7" hint_layout="double"/></fig>
<fig id="F8"><title><p>Figure 8</p></title><caption><p>The position of Copepoda and phylogenetic relationship of pancrustacean lineages are sensitive to the methods selected</p></caption><text>
   <p><b>The position of Copepoda and phylogenetic relationship of pancrustacean lineages are sensitive to the methods selected</b>. Analyses were performed on three datasets: (1) with complete amino acid alignments (Ori); (2) with only sites carrying a moderate evolutionary rate (Mod); (3) with strand-biased proteins excluded (Bal) under the mtArt (ART) or the CAT + mtArt (CAT) models. (A) Ori dataset under the ART model; (B) Ori dataset under the CAT model; (C) Mod dataset under the ART model; (D) Mod dataset under the CAT model; (E) Bal dataset under the ART model and (F) Bal dataset under the CAT model. A schematic version of the Bayesian trees is presented with a number of well-supported lineages collapsed for clarity.</p>
</text><graphic file="1471-2164-12-73-8" hint_layout="double"/></fig>
<p>Monophyly of copepods was well resolved in the results and in a former mitochondrial phylogenetic inference with expanded out-group sampling (Additional file <supplr sid="S5">5</supplr>). Consensus has been reached for the monophyly of Copepoda on the basis of morphological and molecular evidence <abbrgrp>
<abbr bid="B3">3</abbr>
<abbr bid="B4">4</abbr>
<abbr bid="B65">65</abbr>
</abbrgrp>, whereas the phylogenetic relationships among component orders are still controversial. As far as the phylogenetic relationships among copepods are concerned, congruent results were obtained from different analyses during the current research (Figure <figr fid="F9">9</figr>). Harpacticoida (<it>T. japonicus </it>) and monophyletic Siphonostomatoida ( <it>L. salmonis </it>and <it>C. clemensi</it>) grouped together, with the cluster containing Poecilostomatoida ( <it>S. polycolpus</it>) and Cyclopoida ( <it>P. nana </it>) as their sister clade. The grouping of the four orders, excluding Calanoida, confirms the monophyly of Podoplea, which is characteristically tagged by the podoplean tagmosis <abbrgrp>
<abbr bid="B3">3</abbr>
</abbrgrp>. The basal splitting of Copepoda separated Calanoida from Podoplea, reflecting the primary status of Calanoida within copepods. With regard to inter-ordinal phylogenetic relationships within Podoplea, Huys et al. proposed an early split between MCG-Clade (Misophrioida, Cyclopoida and Gelyelloida) and MHPSM-Clade (Mormonilloida, Harpacticoida, Poecilostomatoida, Siphonostomatoida and Monstrilloida), where Poecilostomatoida and Cyclopoida separated into distinct lineages soon after Podoplea was formed <abbrgrp>
<abbr bid="B3">3</abbr>
<abbr bid="B4">4</abbr>
<abbr bid="B6">6</abbr>
<abbr bid="B65">65</abbr>
</abbrgrp>. However, similar to the results from Huys, based on 18S rRNA <abbrgrp>
<abbr bid="B66">66</abbr>
</abbrgrp>, Cyclopoida and Poecilostomatoida exhibited closer affinity in this study, supporting Boxshall's hypothesis to reunite Poecilostomatoida into Cyclopoida. This view is gaining support from several independent analyses <abbrgrp>
<abbr bid="B7">7</abbr>
<abbr bid="B67">67</abbr>
</abbrgrp>. Accordingly, the present results support the hypotheses for (outgroups, Calanoida, ((Cyclopoida, Poecilostomatoida), (Harpacticoida, Siphonostomatoida)).</p>
<suppl id="S5">
<title>
<p>Additional file 5</p>
</title>
<text>
<p>
<b>Phylogenetic tree presenting the monophyly of Copepoda and its position within arthropods inferred from concatenated amino acids of 12 mitochondrial protein coding genes</b>. Tree topologies produced by ML (mtArt model under PHYML) and BI (mtArt model under MrBayes) were compactable. Numbers at the branch nodes refer to Bayesian posterior probabilities and bootstrap support values from left to right.</p>
</text>
<file name="1471-2164-12-73-S5.PNG">
   <p>Click here for file</p>
</file>
</suppl>
<fig id="F9"><title><p>Figure 9</p></title><caption><p>Phylogenetic relationship of five component orders belonging to the subclass Copepoda based on Bayesian analysis</p></caption><text>
   <p><b>Phylogenetic relationship of five component orders belonging to the subclass Copepoda based on Bayesian analysis</b>. Two Branchiopoda and two Maxillopoda species were used as out-groups. Trees from various analyses are of the same topology. Only the tree inferred from original dataset using the model of CAT + mtArt is shown. Each group of four numbers at the branch nodes (clockwise) represents the Bayesian posterior probabilities of four analyses, Ori_ART, Ori_CAT, Bal_ART and Bal_CAT. The names were abbreviated as depicted in Figure 8.</p>
</text><graphic file="1471-2164-12-73-9" hint_layout="single"/></fig>
<p>The position of Copepoda within Pancrustacea is still unclear; the present analyses produced conflicting results using different methods. The uncertainty regarding the position of Copepoda within Pancrustacea is in part due to heterogeneity in evolutionary rates and nucleotide compositions within the lineage <abbrgrp>
<abbr bid="B64">64</abbr>
</abbrgrp>, and may be exacerbated by the derived nature of copepod mitochondrial sequences. Considering that copepods possess notably biased nucleotide compositional profiles, the recovery of the phylogenetic signal may be impeded by lineage-specific compositional heterogeneity. However, exclusion of strand-biased amino acids does not change the relative position of Copepoda (Figure <figr fid="F8">8</figr>), indicating that heterogeneous nucleotide composition may not play a key role in misleading phylogenetic analysis. Nevertheless, accelerated substation rates of copepods do mask and erode phylogenetic signals by attracting long-branched taxa together. One example concerns the previously well-accepted monophyletic group Branchiopoda <abbrgrp>
<abbr bid="B27">27</abbr>
<abbr bid="B65">65</abbr>
</abbrgrp>, which was resolved polyphyletically by clustering <it>A. franciscana </it>with copepods in the original and balanced dataset under the mtArt model. However, analyses performed with only characters carrying a moderate evolutionary rate or with the CAT + mtArt, which has been confirmed as an effective model to overcome the effects of Long Branch Attraction (LBA) <abbrgrp>
<abbr bid="B68">68</abbr>
</abbrgrp>, consistently resolved a monophyletic Branchiopoda clade. Consequently, a possible LBA artefact could be introduced by the accelerated rate of evolution of the mitochondrial genomes of the sampled copepods and branchiopod. Therefore, the clustering of <it>A. franciscana </it>and copepods was regarded as a phylogenetic artefact due to the LBA rather than a sister grouping.</p>
<p>As for the position of Copepoda, four possible sister groups have been proposed in the present study (Figure <figr fid="F8">8</figr>): (1) Oligostraca including Ostracoda, Pentastomida, Branchiura <abbrgrp>
<abbr bid="B69">69</abbr>
</abbrgrp>, (2) Oligostraca plus Remipedia under the model of mtArt and (3) Branchiopoda, (4) Branchiopoda plus Malacostraca under the model of CAT + mtArt. It should be noted that by inspecting branch lengths, Copepoda, Oligostraca and Remipedia are rapidly evolving lineages. A decrease in support (PP = 0.37) for the close affinity of Oligostraca and Copepoda was observed in mtArt trees with the moderate-rate sites. Therefore, their grouping may be misleading owing to the artefact concerning the LBA, as the mtArt model is vulnerable to LBA artefacts. Consequently, although only moderately supported (PP from 0.66 to 0.76), the results obtained with the CAT + mtArt model, in which Copepoda was clustered in the monophyletic clade Vericrustacea, which joins Malacostraca, Branchiopoda, Thecostraca and Copepoda, are accepted in the present study. These results are compatible with a previous phylogenetic analysis based on nuclear protein-coding sequences <abbrgrp>
<abbr bid="B69">69</abbr>
</abbrgrp>.</p>
<p>The relationships among pancrustacean lineages are highly unstable, but several interesting findings resolved using various methods are noted. A monophyletic origin of Pancrustacea (Figures <figr fid="F7">7</figr> and <figr fid="F8">8</figr>) is strongly supported by the current analyses, as recovered by a number of molecular studies on the bases of sequence data <abbrgrp>
<abbr bid="B16">16</abbr>
<abbr bid="B65">65</abbr>
<abbr bid="B70">70</abbr>
</abbrgrp> or mitochondrial gene orders <abbrgrp>
<abbr bid="B71">71</abbr>
</abbrgrp>. In accordance with other mitochondrial studies <abbrgrp>
<abbr bid="B72">72</abbr>
</abbrgrp>, monophyly of Hexapoda and Crustacea was rejected in the present study, although relationships among the component lineages remain unstable. However, it is noteworthy that the affinity of Insecta and Collembola was resolved under the model of CAT + mtArt, while the grouping of Collembola with Diplura was recovered under the model of mtArt using only moderate-rate sites. These results prevent the rejection of the monophyly of Hexapoda on the basis of mitochondrial genomic data alone. Contradictory conclusions from mitochondrial sequences and nuclear sequences <abbrgrp>
<abbr bid="B69">69</abbr>
<abbr bid="B70">70</abbr>
<abbr bid="B73">73</abbr>
</abbrgrp> mean that the monophyly of Hexapoda and Crustacea are open questions. Traditional but controversial Maxillopoda was resolved paraphyletic/polyphyletic in the present analyses, where it can be sub-divided into three groups including Copepoda, Cirripedia and Pentastomida plus Branchiura. This division of Maxillopoda is in agreement with recent studies based on combined data of 18S rRNA and two mitochondrial markers <abbrgrp>
<abbr bid="B65">65</abbr>
</abbrgrp>, and nuclear protein-coding genes <abbrgrp>
<abbr bid="B69">69</abbr>
</abbrgrp>.</p>
</sec>
<sec>
<st>
<p>Intraspecific sequence variability and its utility for population genetic analysis</p>
</st>
<p>The major loci were scanned on the basis of the alignment of 11 mitogenomes from four populations for proper molecular markers. No evidence for recombination in the mitogenome of <it>C. sinicus </it>was detected. Within the 16,670 bp alignment [GenBank: <ext-link ext-link-id="HQ619228" ext-link-type="gen">HQ619228</ext-link>-<ext-link ext-link-id="HQ619237" ext-link-type="gen">HQ619237</ext-link>] there were a total of 397 variable sites including 295 single nucleotide polymorphisms (SNPs) comprising 191 nucleotide substitutions and 104 insertion/deletion polymorphic sites, in addition to three microsatellite motifs. A sliding-window analysis was performed to map the distribution of variable sites among 11 individuals in DNAsp <abbrgrp>
<abbr bid="B74">74</abbr>
</abbrgrp> (Figure <figr fid="F10">10</figr>). The mean frequency of the variable sites was relatively low (approximately 0.024). LNR3, harbouring two microsatellite motifs, was most variable, while <it>nad4L </it>was the most conserved with no changed sites. The "hotspots" bearing the highest frequency of variable sites were bases 11216-12260 with 226 variable sites in 1045 bases (1 in 4.6), 1848-2235 with 22 variable sites in 388 bases (1 in 17.6), and 649-862 with 10 variable sites in 214 bases (1 in 21.4). The former corresponds to LNR3, and the others span LNR1 and upstream of <it>rrnL</it>. The gene encoding <it>cox1 </it>is used wildly as molecular marker for analysis at the population level <abbrgrp>
<abbr bid="B13">13</abbr>
</abbrgrp>. However, a 752 bp stretch spanning 4016-4767, which is located in the gene of <it>cox1</it>, has no variable site. The results revealed the conserved nature of <it>cox1 </it>and eliminated the possibility of its utility in population genetics for <it>C. sinicus</it>. Constant phylogenetic signals, which distinguish haplotype groups, were detected in the hyper-variable regions (unpublished data), underpinning their utility for population analysis.</p>
<fig id="F10"><title><p>Figure 10</p></title><caption><p>Plot representation of the frequency of variable sites across the mitochondrial genome of <it>C. sinicus</it></p></caption><text>
   <p><b>Plot representation of the frequency of variable sites across the mitochondrial genome of <it>C. sinicus</it></b>. The sliding-window was 200 bp in length and slid 2 bp at a time. Eleven individuals from four populations were used to determine the intraspecific distribution pattern of the variable sites. The bar at the top illustrates the position and orientation of each gene. The shaded boxes on the x-axis indicate sections that were not covered during the comparison. The horizontal line indicates the mean value of the frequency.</p>
</text><graphic file="1471-2164-12-73-10" hint_layout="double"/></fig>
<p>Non-coding regions are most variable, followed by PCGs and rRNA genes, with tRNA genes being the most conserved (Table <tblr tid="T2">2</tblr>). No deletions or insertions were identified in the coding regions. Of the 191 nucleotide substitutions, 50.2% were identified as parsimony informative. A distinct bias for nucleotide transitions over transversions was evident, with 87% substitutions being transitions.</p>
<p>Of the 55 SNPs in PCGs, only 18 can give rise to amino acid substitutions. The majority of the SNPs (62.1%) are located in wobble codon positions. Only 14.0% were identified in second positions, all of which lead to amino acid substitutions. Among PCGs, <it>nad6 </it>is most divergent (0.00333). As presented in Table <tblr tid="T2">2</tblr>, intraspecific dN/dS ratios were either zero or at relatively high levels, ranging from 0.157 for <it>atp6 </it>to 0.914 for <it>nad4</it>. The overall intraspecific dN/dS ratio was 4.14 times larger than the counterpart between species, which is compatible with a nearly neutral model in which most amino acid substitutions are slightly deleterious <abbrgrp>
<abbr bid="B75">75</abbr>
</abbrgrp>.</p>
<p>In terms of the 11 nucleotide substitutions in structural RNA genes, seven were situated in stems and the remainder in loops. Three substitutions in stems induced deterioration of stem connections. However, they were all present at lateral margins, indicating little influence on the overall secondary structures. As mentioned above, such mis-pairing is considered to be restored during transcription.</p>
<p>In addition to substitution and indel variations, three microsatellite motifs were identified at regions from bases 11492 to 11504, 11645 to 11756 and 13066 to 13095. "TCC" unit is the core repeat for the first motif, while "TA" acts as a core repeat for the others. The second microsatellite is most variable with repeat numbers from 10 to 56. With additional subtle sequence variations within the motif, the 11 sequenced microsatellite loci can be sub-divided into seven alleles.</p>
</sec>
</sec>
<sec>
<st>
<p>Conclusions</p>
</st>
<p>This study presents the first nearly complete mitochondrial genome of a Calanoida species. The <it>circular subgenomic fragment </it>obtained invoked caution when analyzing the mitogenome of copepods using long PCR technology, and may offer additional evidence for mitochondrial recombination.</p>
<p>Although the contents and lengths of individual genes are similar to other arthropods, the mitogenome of <it>C. sinicus</it>, one of the largest mtDNAs in crustaceans, is enlarged by the prevalence and extended length of non-coding regions. The concurrence of multiple non-coding regions and reshuffled gene arrangement results in the mitochondrial genome of <it>C. sinicus </it>being remarkably distinctive from other arthropods. Mitochondrial DNA recombination may have played an important role in shaping the present mitochondrial profile of <it>C. sinicus</it>. The lack of synapomorphic gene arrangements among copepods raises questions concerning the use of gene order as a useful molecular marker for deep phylogenetic analysis.</p>
<p>Recovery of the phylogenetic signal in mitochondrial genomes may be affected a variety of reconstruction artefacts including lineage-specific heterogeneities for nucleotide composition and evolutionary rate. In particular, the LBA artefact influenced the results during analysis. Several methods were designed to reduce the dilution effect of the reconstruction artefacts. Although unstable, some inspiring congruent results were noticed in the analyses. Monophyly of copepods and the basal split between Calanoida and Podoplea were successfully resolved. The close affinity between Cyclopoida and Poecilostomatoida in the present study supports Boxshall in reassigning the latter subordinate to the former. Copepoda was clustered within the monophyletic clade Vericrustacea, although relationships among the lineages remain ambiguous. Falsification of Maxillopoda was confirmed by unveiling its paraphyletic/polyphyletic nature. However, owing to the limited phylogenetic signals in the mitochondrial data sets, no consensus concerning relationships among pancrustacean lineages was reached.</p>
<p>Within the 16,670 bp alignment there are a total of 397 variable sites. Indel variations are present in non-coding regions and transitions dominate the nucleotide substitutions. Three "hot-spots", particularly the hyper-variable microsatellite locus in LNRs, provide rich polymorphisms for population studies.</p>
</sec>
<sec>
<st>
<p>Methods</p>
</st>
<sec>
<st>
<p>Sample Collection and DNA Extraction</p>
</st>
<p>
<it>C. sinicus </it>for mitogenome characterization were collected from the Yellow Sea (35.9N, 122.9E) with a 500 &#956;m mesh zooplankton net during a summer cruise in 2006. The samples were preserved at -80&#176;C pending DNA extraction. To compare the intraspecific polymorphism pattern of different loci among populations, <it>C. sinicus </it>from Yangtze estuary (28.6N, 122.1E; 4 individuals), North Yellow Sea (38.7N, 120.7E; 3 individuals) and Korea (36.9N, 126.3E; 3 individuals) were selected.</p>
<p>Fifty individuals of <it>C. sinicus </it>from the same population (the Yellow Sea) were pooled to prepare a DNA template for mitogenome sequencing. To avoid the potential influence of nuclear DNA sequences on mitochondrial origin, crude mitochondria were roughly separated from cell debris and nuclei using differential centrifugation with a commercial tissue mitochondria isolation kit (Beyotime, C3606). mtDNA was extracted using the DIAamp DNA Micro Kit (Qiagen, Valencia, CA) following the manufacturer's instructions. For intraspecific comparison, genome DNA was extracted individually.</p>
</sec>
<sec>
<st>
<p>PCR amplification and sequencing</p>
</st>
<p>Partial sequences of the genes <it>atp6</it>, <it>cytb, nad4 </it>and <it>rrnS </it>were determined using the primers presented in additional file <supplr sid="S6">6</supplr>. On the basis of the sequence data obtained, long PCR primers (Additional file <supplr sid="S6">6</supplr>) were designed to amplify the entire <it>C. sinicus </it>mitogenome. Two PCR fragments with lengths of 3.8 kb and 11 kb were successfully amplified with the combination of primer cs_<it>cytb</it>F1 plus cs_16sR1 and primer <it>cytb</it>f3 plus cs_<it>nad4</it>f. PCR reactions were performed using a Mastercycler Pro gradient machine (Eppendorf) in a 50 &#956;l system containing 30 pmol of each primer, 3 nmol of each dNTP, 1.5 units of LA <it>taq </it>polymerase and approximately 20 ng of mtDNA template in 1X La taq buffer supplied by Takara. The cycle profile was initiated with a denaturation step of 94&#176;C for 3 min, followed by 33 cycles of 95&#176;C for 20 s, 58&#176;C for 30 s, 68&#176;C for 1 min/1 kb, and terminated with a final extension cycle of 72&#176;C for 10 min. The product was purified with an E.Z.N.A. gel extraction kit (Omega) and sequenced directly by the primer walking approach (Additional file <supplr sid="S6">6</supplr>).</p>
<suppl id="S6">
<title>
<p>Additional file 6</p>
</title>
<text>
<p>
<b>List of primers used to determine the mitogenome of <it>C. sinicus</it>
</b>. Numbers refer to the nucleotide positions of 5' end of primers.</p>
</text>
<file name="1471-2164-12-73-S6.XLS">
   <p>Click here for file</p>
</file>
</suppl>
<p>Additional primers facing outward were designed from both ends of the contig. A 4.5 kb fragment, with which all contigs could be cyclized, was amplified. However, the absence of <it>atp6 </it>in the amplicon made the results unreliable. Illegality of the fragment was confirmed by failure of amplification with primers from dubious regions.</p>
<p>Step-out PCR techniques <abbrgrp>
<abbr bid="B57">57</abbr>
</abbrgrp> were applied for the remaining mitogenome in two directions, with the primers targeting lateral margins. Despite repeated attempts, the amplification terminated in certain regions on both sides. PCR products smaller than 5 kb were sequenced directly. Some short PCR fragments were also cloned using PMD-18T (Takara) vector before sequencing when they were unable to be sequenced directly. The 11 kb PCR product was sheared into small fragments of 1-3 kb using HydroShear (Genomic Solutions), and then cloned with PUC19 vector (Fermentas) after being end-repaired with T4 DNA polymerase (NEB) following the manufacturer's protocol. Forty clones were sequenced with an ABI 3730 sequencer from Biosune (Shanghai) company.</p>
<p>On the basis of the mitogenome sequences obtained, four primer combinations from relatively conserved regions were designed for screening polymorphism loci in the <it>C. sinicus </it>mitogenome. Fragments of 1.1 kb to 9.5 kb in length from 10 individuals were sequenced using the methods described above. Base calling was performed with phred, and the reads were assembled in phrap with default parameters <abbrgrp>
<abbr bid="B76">76</abbr>
<abbr bid="B77">77</abbr>
</abbrgrp>. All assembled sequences were manually verified with the aid of CONSED to remove misassembles <abbrgrp>
<abbr bid="B78">78</abbr>
</abbrgrp>. The nearly complete mitochondrial genome of <it>C. sinicus </it>has been deposited in GenBank with the accession number [GenBank: <ext-link ext-link-id="GU355641" ext-link-type="gen">GU355641</ext-link>].</p>
</sec>
<sec>
<st>
<p>Sequence analysis and annotation</p>
</st>
<p>DNA sequences were analyzed using the software package Lasergene ver. 7.1.0 (DNASTAR, Inc. Madison) and Vector NTI Advance 9 (Invitrogen, Carlsbad, CA). Locations of protein coding genes were preliminarily identified by the ORFs finding method from GeneQuest, followed by BLAST searching on GenBank datasets. The locations were refined using multiple alignments to other crustacean nucleotide sequences. tRNA genes were identified by their proposed cloverleaf secondary structure and anti-codon sequences using tRNAscan-SE1.21<abbrgrp>
<abbr bid="B79">79</abbr>
</abbrgrp> and ARWEN <abbrgrp>
<abbr bid="B80">80</abbr>
</abbrgrp> with relaxed settings, and confirmed manually. Two rRNA genes were determined by comparing with other annotated crustacean mitogenomes, and reconfirmed using their secondary structures. Inferred rRNA sequences were aligned with other crustaceans, whose secondary structures have been launched (<it>rrnS </it>and <it>rrnL </it>obtained from the rRNA database: <url>http://www.psb.ugent.be/rRNA/index.html</url>) by means of the program DCSE <abbrgrp>
<abbr bid="B81">81</abbr>
</abbrgrp>. The program RnaViz 2 <abbrgrp>
<abbr bid="B82">82</abbr>
</abbrgrp> was used to draw secondary structures of tRNAs and rRNAs. Secondary structures of the putative control region, according to the model for arthropods <abbrgrp>
<abbr bid="B25">25</abbr>
</abbrgrp>, were estimated using UNAfold <abbrgrp>
<abbr bid="B83">83</abbr>
</abbrgrp>. Gene divergence and synonymous and non-synonymous substitution rates in the protein coding genes were calculated with DnaSP 4.0 and PAML 4.3 <abbrgrp>
<abbr bid="B84">84</abbr>
</abbrgrp>.</p>
</sec>
<sec>
<st>
<p>Sequence alignment and phylogenetic analysis</p>
</st>
<p>In addition to the complete mitochondrial genome of <it>C. sinicus </it>presented here, mitogenome sequences of another 105 arthropods were retrieved from GenBank. Information concerning phylogenetic position, gene order, nucleotides and amino acids of individual genes, and sizes of mitogenomes, was extracted from the combined datasets using purpose-built perl scripts (Additional file <supplr sid="S7">7</supplr>). To avoid artefacts due to asymmetric nucleotide composition, the nucleotide content of a concatenated sequence of PCGs from the initial dataset were compared. Principal components analysis (PCA) was performed with contents of component nucleotides as variables (Additional file <supplr sid="S8">8</supplr>). The results were used as a guide for sampling taxa with relatively homologous nucleotide compositions. The nucleotide composition and strand asymmetry of some maxillopods are not as balanced, but they were included for complete taxon coverage. A sample containing 36 species including three copepods was selected. Amino acid sequences of individual proteins were aligned using Probalign under the default settings for protein <abbrgrp>
<abbr bid="B85">85</abbr>
</abbrgrp>. <it>atp8 </it>was not included as it was absent from some taxa sampled. A dataset (Original dataset) of 2646 amino acids with posterior probabilities above five was accepted for the subsequent phylogenetic analysis. To explore the signal in the dataset and clarify the placement of Copepoda, two additional datasets were introduced. For the first dataset (Balanced dataset), only proteins whose nucleotide composition was not significantly strand-biased were included. Since too-rapidly and too-slowly evolving sites may affect phylogenetic analysis <abbrgrp>
<abbr bid="B64">64</abbr>
</abbrgrp>, another dataset (moderate-rate dataset) was constructed by removing classes of rapidly and slowly evolving sites using the slow-fast approach <abbrgrp>
<abbr bid="B64">64</abbr>
<abbr bid="B86">86</abbr>
</abbrgrp>, in which sites were partitioned into quartiles and only those from the two internal ones were accepted.</p>
<suppl id="S7">
<title>
<p>Additional file 7</p>
</title>
<text>
<p>
<b>Perl scripts to extract mitochondrial genomic information from GenBank files</b>. Bioperl modules are required.</p>
</text>
<file name="1471-2164-12-73-S7.PL">
   <p>Click here for file</p>
</file>
</suppl>
<suppl id="S8">
<title>
<p>Additional file 8</p>
</title>
<text>
<p>
<b>Nucleotide compositional properties of the candidates for phylogenetic analysis illustrated by principal components analysis (PCA)</b>. PCA ordination was based on the proportion of separate nucleotides. PC1 (principal component 1) explained 87% of the total variations and PC2 explained 10% of the total variations. Species were sampled predominantly inside the red ellipse.</p>
</text>
<file name="1471-2164-12-73-S8.PNG">
   <p>Click here for file</p>
</file>
</suppl>
<p>To understand phylogenetic relationships among copepods, two smaller datasets were built (Original and Balanced datasets). These datasets consisted of six copepods whose complete mitochondrial genomes have been (almost) entirely determined, in addition to two branchiopods and two maxillopods as out-groups. For the intraspecific sequence variability analysis, reads from another 10 individuals were assembled and manually aligned in BioEdit (North Carolina State University, NC) using the <it>C. sinicus </it>mitogenome as a template. Alignment was performed on individual genes with sequences from other copepods using Probalign to estimate sequence divergence of various loci.</p>
<p>According to preliminary analysis, the CAT + MtArt model and MtArt model fit the data best and were selected for further analysis. Bayesian analyses were carried out using MrBayes (MtArt model) and PhyloBayes (CAT + MtArt model), with an among-site rate variation under a gamma distribution using four activated categories. Two independent MCMC chains were run simultaneously to determine whether the searching reached stabilization, and were stopped when all chains converged (maxdiff less than 0.2, but in most of the cases less than 0.1 for PhyloBayes; standard deviation [SD] of split frequencies lower than 0.01 for MrBayes). If not, runs were continued until more than 5000 sample points were available per run. The ML analysis was carried out with PHYML 3.0 with 200 bootstrap replicates.</p>
</sec>
</sec>
<sec>
<st>
<p>Abbreviations</p>
</st>
<p>
<it>atp6 </it>and <it>8</it>: ATPase subunit 6 and 8; bp: base pair (s); BI: Bayesian inference; BP: Bootstrap; <it>cox1</it>-<it>3</it>: cytochrome c oxidase subunits I-III; <it>cytb</it>: cytochrome b; LBA: long-branch attraction; <it>rrnL</it>: 16S ribosomal RNA; LNR: large non-coding region; ML: maximum likelihood; mitogenome: mitochondrial genome; mtDNA: mitochondrial DNA; nt: nucleotide (s); <it>nad1</it>-<it>6 </it>and 4L: NADH dehydrogenase subunits 1-6 and 4L; ORF: open reading frame; PCG: protein coding gene; PCR: polymerase chain reaction; PP: Bayesian posterior probabilities; rRNA: ribosomal RNA; SNP: single nucleotide polymorphism; <it>rrnS</it>: 12S ribosomal RNA; tRNA: transfer RNA; <it>trnX </it>(where X is replaced by single letter amino acid code of the corresponding amino acid): tRNA gene.</p>
</sec>
<sec>
<st>
<p>Authors' contributions</p>
</st>
<p>MXW and SS contributed to the conception and design of the study. MXW and XS conducted the majority of the laboratory work and are responsible for the data analysis. SS and CLL supervised the study and provided technical support during experiments. SS, CLL and MXW cooperated with the writing of the manuscript. XS provided important advice on revision of the manuscript. All authors read and approved the final manuscript.</p>
</sec>
</bdy><bm>
<ack>
<sec>
<st>
<p>Acknowledgements</p>
</st>
<p>We thank Fangping Cheng and Shiwei Wang for their assistance with sample collection and species identification. We are grateful to Rencheng Wang for his kind support during the laboratory work. We appreciate the editors, Ann Bucklin, Sha Zhongli, Luan Weisha, Yu Haiyan and all the reviewers for their valuable comments. The English revision of the manuscript was made by BioMedEs. The work was supported by the Chinese Academy of Sciences (KZCX2-YW-Q07 and GJHZ200808), National Natural Science Foundation of China (40821004 and 40631008) and State Oceanic Administration of China (200805042).</p>
</sec>
</ack>
<refgrp><bibl id="B1"><title><p>How Many Copepods</p></title><aug><au><snm>Humes</snm><fnm>AG</fnm></au></aug><source>Hydrobiologia</source><pubdate>1994</pubdate><volume>293</volume><fpage>1</fpage><lpage>7</lpage><xrefbib><pubid idtype="doi">10.1007/BF00229916</pubid></xrefbib></bibl><bibl id="B2"><title><p>The Biology of Calanoid Copepods</p></title><aug><au><snm>Mauchline</snm><fnm>J</fnm></au></aug><publisher>Academic Press, London</publisher><pubdate>1998</pubdate><fpage>710</fpage></bibl><bibl id="B3"><title><p>Copepod evolution</p></title><aug><au><snm>Huys</snm><fnm>Rony</fnm></au><au><snm>Boxshall</snm><fnm>GA</fnm></au></aug><publisher>Ray Society</publisher><pubdate>1991</pubdate><volume>159</volume></bibl><bibl id="B4"><title><p>An updated classification of the recent Crustacea</p></title><aug><au><snm>Martin</snm><fnm>JW</fnm></au><au><snm>Davis</snm><fnm>GE</fnm></au></aug><source>History Museum of Los Angeles County: Los Angeles, CA (USA) VII</source><pubdate>2001</pubdate><fpage>123</fpage><note>
   <b>Science Series 39</b>
</note></bibl><bibl id="B5"><title><p>A propos du r&#233;pertoire mondial des Calano&#239;des des eaux continentales</p></title><aug><au><snm>Dussart</snm><fnm>BH</fnm></au></aug><source>Crustaceana</source><pubdate>1984</pubdate><fpage>25</fpage><lpage>31</lpage></bibl><bibl id="B6"><title><p>Copepod Phylogeny - a Reconsideration of Huys-and-Boxhall Parsimony Versus Homology</p></title><aug><au><snm>Ho</snm><fnm>JS</fnm></au></aug><source>Hydrobiologia</source><pubdate>1994</pubdate><volume>293</volume><fpage>31</fpage><lpage>39</lpage><xrefbib><pubid idtype="doi">10.1007/BF00229920</pubid></xrefbib></bibl><bibl id="B7"><title><p>An introduction to copepod diversity</p></title><aug><au><snm>Boxshall</snm><fnm>G</fnm></au><au><snm>Halsey</snm><fnm>S</fnm></au></aug><source>2004: Ray Soc</source><pubdate>2004</pubdate></bibl><bibl id="B8"><title><p>Higher-level crustacean phylogeny: Consensus and conflicting hypotheses</p></title><aug><au><snm>Jenner</snm><fnm>RA</fnm></au></aug><source>Arthropod Struct Dev</source><pubdate>2010</pubdate><volume>39</volume><issue>2-3</issue><fpage>143</fpage><lpage>153</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.asd.2009.11.001</pubid><pubid idtype="pmpid" link="fulltext">19944189</pubid></pubidlist></xrefbib></bibl><bibl id="B9"><title><p>Animal Mitochondrial-DNA - Structure and Evolution</p></title><aug><au><snm>Wolstenholme</snm><fnm>DR</fnm></au></aug><source>International Review of Cytology-a Survey of Cell Biology</source><pubdate>1992</pubdate><volume>141</volume><fpage>173</fpage><lpage>216</lpage></bibl><bibl id="B10"><title><p>Animal mitochondrial genomes</p></title><aug><au><snm>Boore</snm><fnm>JL</fnm></au></aug><source>Nucleic Acids Res</source><pubdate>1999</pubdate><volume>27</volume><issue>8</issue><fpage>1767</fpage><lpage>1780</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/nar/27.8.1767</pubid><pubid idtype="pmcid">148383</pubid><pubid idtype="pmpid">10101183</pubid></pubidlist></xrefbib></bibl><bibl id="B11"><title><p>Beyond linear sequence comparisons: the use of genome-level characters for phylogenetic reconstruction</p></title><aug><au><snm>Boore</snm><fnm>JL</fnm></au><au><snm>Fuerstenberg</snm><fnm>SI</fnm></au></aug><source>Philosophical transactions of the Royal Society of London</source><pubdate>2008</pubdate><volume>363</volume><issue>1496</issue><fpage>1445</fpage><lpage>1451</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1098/rstb.2007.2234</pubid><pubid idtype="pmcid">2614225</pubid><pubid idtype="pmpid">18192190</pubid></pubidlist></xrefbib></bibl><bibl id="B12"><title><p>Two mitochondrial lineages in Korean freshwater Corbicula (Corbiculidae: Bivalvia)</p></title><aug><au><snm>Park</snm><fnm>JK</fnm></au><au><snm>Choe</snm><fnm>BL</fnm></au><au><snm>Eom</snm><fnm>KS</fnm></au></aug><source>Molecules and Cells</source><pubdate>2004</pubdate><volume>17</volume><issue>3</issue><fpage>410</fpage><lpage>414</lpage><xrefbib><pubid idtype="pmpid">15232214</pubid></xrefbib></bibl><bibl id="B13"><title><p>Population structure of the planktonic copepod Calanus pacificus in the North Pacific Ocean</p></title><aug><au><snm>Nuwer</snm><fnm>M</fnm></au><au><snm>Frost</snm><fnm>B</fnm></au><au><snm>Armbrust</snm><fnm>EV</fnm></au></aug><source>Marine Biology</source><pubdate>2008</pubdate><volume>156</volume><issue>2</issue><fpage>107</fpage><lpage>115</lpage><xrefbib><pubid idtype="doi">10.1007/s00227-008-1068-y</pubid></xrefbib></bibl><bibl id="B14"><title><p>Three divergent mitochondrial genomes from California populations of the copepod Tigriopus californicus</p></title><aug><au><snm>Burton</snm><fnm>RS</fnm></au><au><snm>Byrne</snm><fnm>RJ</fnm></au><au><snm>Rawson</snm><fnm>PD</fnm></au></aug><source>Gene</source><pubdate>2007</pubdate><volume>403</volume><issue>1-2</issue><fpage>53</fpage><lpage>59</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.gene.2007.07.026</pubid><pubid idtype="pmpid" link="fulltext">17855023</pubid></pubidlist></xrefbib></bibl><bibl id="B15"><title><p>Incorporating molecular evolution into phylogenetic analysis, and a new compilation of conserved polymerase chain reaction primers for animal mitochondrial DNA</p></title><aug><au><snm>Simon</snm><fnm>C</fnm></au><au><snm>Buckley</snm><fnm>TR</fnm></au><au><snm>Frati</snm><fnm>F</fnm></au><au><snm>Stewart</snm><fnm>JB</fnm></au><au><snm>Beckenbach</snm><fnm>AT</fnm></au></aug><source>Annual Review of Ecology Evolution and Systematics</source><pubdate>2006</pubdate><volume>37</volume><fpage>545</fpage><lpage>579</lpage><xrefbib><pubid idtype="doi">10.1146/annurev.ecolsys.37.091305.110018</pubid></xrefbib></bibl><bibl id="B16"><title><p>Phylogeny of Arthropoda inferred from mitochondrial sequences: Strategies for limiting the misleading effects of multiple changes in pattern and rates of substitution</p></title><aug><au><snm>Hassanin</snm><fnm>A</fnm></au></aug><source>Mol Phylogenet Evol</source><pubdate>2006</pubdate><volume>38</volume><issue>1</issue><fpage>100</fpage><lpage>116</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.ympev.2005.09.012</pubid><pubid idtype="pmpid" link="fulltext">16290034</pubid></pubidlist></xrefbib></bibl><bibl id="B17"><title><p>Genetic markers in blue crabs (Callinectes sapidus) II. Complete mitochondrial genome sequence and characterization of genetic variation</p></title><aug><au><snm>Place</snm><fnm>AR</fnm></au><au><snm>Feng</snm><fnm>XJ</fnm></au><au><snm>Steven</snm><fnm>CR</fnm></au><au><snm>Fourcade</snm><fnm>HM</fnm></au><au><snm>Boore</snm><fnm>JL</fnm></au></aug><source>Journal of Experimental Marine Biology and Ecology</source><pubdate>2005</pubdate><volume>319</volume><issue>1-2</issue><fpage>15</fpage><lpage>27</lpage><xrefbib><pubid idtype="doi">10.1016/j.jembe.2004.03.024</pubid></xrefbib></bibl><bibl id="B18"><title><p>Complete mitochondrial DNA sequence of Tigriopus japonicus (Crustacea: Copepoda)</p></title><aug><au><snm>Machida</snm><fnm>RJ</fnm></au><au><snm>Miya</snm><fnm>MU</fnm></au><au><snm>Nishida</snm><fnm>M</fnm></au><au><snm>Nishida</snm><fnm>S</fnm></au></aug><source>Marine Biotechnology</source><pubdate>2002</pubdate><volume>4</volume><issue>4</issue><fpage>406</fpage><lpage>417</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1007/s10126-002-0033-x</pubid><pubid idtype="pmpid" link="fulltext">14961252</pubid></pubidlist></xrefbib></bibl><bibl id="B19"><title><p>The complete mitochondrial genome of the intertidal copepod Tigriopus sp (Copepoda, Harpactidae) from Korea and phylogenetic considerations</p></title><aug><au><snm>Jung</snm><fnm>SO</fnm></au><au><snm>Lee</snm><fnm>YM</fnm></au><au><snm>Park</snm><fnm>TJ</fnm></au><au><snm>Park</snm><fnm>HG</fnm></au><au><snm>Hagiwara</snm><fnm>A</fnm></au><au><snm>Leung</snm><fnm>KMY</fnm></au><au><snm>Dahms</snm><fnm>HU</fnm></au><au><snm>Lee</snm><fnm>W</fnm></au><au><snm>Lee</snm><fnm>JS</fnm></au></aug><source>Journal of Experimental Marine Biology and Ecology</source><pubdate>2006</pubdate><volume>333</volume><issue>2</issue><fpage>251</fpage><lpage>262</lpage><xrefbib><pubid idtype="doi">10.1016/j.jembe.2005.12.047</pubid></xrefbib></bibl><bibl id="B20"><title><p>Genetic characterization of the mitochondrial DNA from Lepeophtheirus salmonis (Crustacea: Copepoda). A new gene organization revealed</p></title><aug><au><snm>Tjensvoll</snm><fnm>K</fnm></au><au><snm>Hodneland</snm><fnm>K</fnm></au><au><snm>Nilsen</snm><fnm>F</fnm></au><au><snm>Nylund</snm><fnm>A</fnm></au></aug><source>Gene</source><pubdate>2005</pubdate><volume>353</volume><issue>2</issue><fpage>218</fpage><lpage>230</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.gene.2005.04.033</pubid><pubid idtype="pmpid" link="fulltext">15987668</pubid></pubidlist></xrefbib></bibl><bibl id="B21"><title><p>The complete mitochondrial genome of the cyclopoid copepod Paracyclopina nana: A highly divergent genome with novel gene order and atypical gene numbers</p></title><aug><au><snm>Ki</snm><fnm>JS</fnm></au><au><snm>Park</snm><fnm>HG</fnm></au><au><snm>Lee</snm><fnm>JS</fnm></au></aug><source>Gene</source><pubdate>2009</pubdate><volume>435</volume><issue>1-2</issue><fpage>13</fpage><lpage>22</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.gene.2009.01.005</pubid><pubid idtype="pmpid" link="fulltext">19393182</pubid></pubidlist></xrefbib></bibl><bibl id="B22"><title><p>Why does Calanus sinicus prosper in the shelf ecosystem of the Northwest Pacific Ocean?</p></title><aug><au><snm>Uye</snm><fnm>S</fnm></au></aug><source>Ices J Mar Sci</source><pubdate>2000</pubdate><volume>57</volume><issue>6</issue><fpage>1850</fpage><lpage>1855</lpage><xrefbib><pubid idtype="doi">10.1006/jmsc.2000.0965</pubid></xrefbib></bibl><bibl id="B23"><title><p>Large-scale gene rearrangements in the mitochondrial genomes of two calanoid copepods Eucalanus bungii and Neocalanus cristatus (Crustacea), with notes on new versatile primers for the srRNA and COI genes</p></title><aug><au><snm>Machida</snm><fnm>RJ</fnm></au><au><snm>Miya</snm><fnm>MU</fnm></au><au><snm>Nishida</snm><fnm>M</fnm></au><au><snm>Nishida</snm><fnm>S</fnm></au></aug><source>Gene</source><pubdate>2004</pubdate><volume>332</volume><fpage>71</fpage><lpage>78</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.gene.2004.01.019</pubid><pubid idtype="pmpid" link="fulltext">15145056</pubid></pubidlist></xrefbib></bibl><bibl id="B24"><title><p>Evidence for multiple reversals of asymmetric mutational constraints during the evolution of the mitochondrial genome of metazoa, and consequences for phylogenetic inferences</p></title><aug><au><snm>Hassanin</snm><fnm>A</fnm></au><au><snm>Leger</snm><fnm>N</fnm></au><au><snm>Deutsch</snm><fnm>J</fnm></au></aug><source>Systematic biology</source><pubdate>2005</pubdate><volume>54</volume><issue>2</issue><fpage>277</fpage><lpage>298</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1080/10635150590947843</pubid><pubid idtype="pmpid" link="fulltext">16021696</pubid></pubidlist></xrefbib></bibl><bibl id="B25"><title><p>The complete mitochondrial genome of the common sea slater, Ligia oceanica (Crustacea, Isopoda) bears a novel gene order and unusual control region features</p></title><aug><au><snm>Kilpert</snm><fnm>F</fnm></au><au><snm>Podsiadlowski</snm><fnm>L</fnm></au></aug><source>Bmc Genomics</source><pubdate>2006</pubdate><volume>7</volume><xrefbib><pubidlist><pubid idtype="doi">10.1186/1471-2164-7-241</pubid><pubid idtype="pmcid">1590035</pubid><pubid idtype="pmpid">16987408</pubid></pubidlist></xrefbib></bibl><bibl id="B26"><title><p>Transcription and replication of mitochondrial DNA</p></title><aug><au><snm>Clayton</snm><fnm>DA</fnm></au></aug><source>Human reproduction (Oxford, England)</source><pubdate>2000</pubdate><volume>15</volume><issue>Suppl 2</issue><fpage>11</fpage><lpage>17</lpage><xrefbib><pubid idtype="pmpid">11041509</pubid></xrefbib></bibl><bibl id="B27"><title><p>The relationship between the rate of molecular evolution and the rate of genome rearrangement in animal mitochondrial genomes</p></title><aug><au><snm>Xu</snm><fnm>W</fnm></au><au><snm>Jameson</snm><fnm>D</fnm></au><au><snm>Tang</snm><fnm>B</fnm></au><au><snm>Higgs</snm><fnm>PG</fnm></au></aug><source>Journal of Molecular Evolution</source><pubdate>2006</pubdate><volume>63</volume><issue>3</issue><fpage>375</fpage><lpage>392</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1007/s00239-005-0246-5</pubid><pubid idtype="pmpid" link="fulltext">16838214</pubid></pubidlist></xrefbib></bibl><bibl id="B28"><title><p>Mitochondrial genome data support the basal position of Acoelomorpha and the polyphyly of the Platyhelminthes</p></title><aug><au><snm>Ruiz-Trillo</snm><fnm>I</fnm></au><au><snm>Riutort</snm><fnm>M</fnm></au><au><snm>Fourcade</snm><fnm>HM</fnm></au><au><snm>Baguna</snm><fnm>J</fnm></au><au><snm>Boore</snm><fnm>JL</fnm></au></aug><source>Mol Phylogenet Evol</source><pubdate>2004</pubdate><volume>33</volume><issue>2</issue><fpage>321</fpage><lpage>332</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.ympev.2004.06.002</pubid><pubid idtype="pmpid" link="fulltext">15336667</pubid></pubidlist></xrefbib></bibl><bibl id="B29"><title><p>Collection of Small-Subunit (16s- and 16s-Like) Ribosomal-Rna Structures - 1994</p></title><aug><au><snm>Gutell</snm><fnm>RR</fnm></au></aug><source>Nucleic Acids Res</source><pubdate>1994</pubdate><volume>22</volume><issue>17</issue><fpage>3502</fpage><lpage>3507</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/nar/22.17.3502</pubid><pubid idtype="pmcid">308311</pubid><pubid idtype="pmpid">7524024</pubid></pubidlist></xrefbib></bibl><bibl id="B30"><title><p>Database on the structure of large subunit ribosomal RNA</p></title><aug><au><snm>De Rijk</snm><fnm>P</fnm></au><au><snm>Robbrecht</snm><fnm>E</fnm></au><au><snm>de Hoog</snm><fnm>S</fnm></au><au><snm>Caers</snm><fnm>A</fnm></au><au><snm>Van de Peer</snm><fnm>Y</fnm></au><au><snm>De Wachter</snm><fnm>R</fnm></au></aug><source>Nucleic Acids Research</source><pubdate>1999</pubdate><volume>27</volume><issue>1</issue><fpage>174</fpage><lpage>178</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/nar/27.1.174</pubid><pubid idtype="pmcid">148127</pubid><pubid idtype="pmpid">9847172</pubid></pubidlist></xrefbib></bibl><bibl id="B31"><title><p>The complete sequence of the mitochondrial genome of Daphnia pulex (Cladocera: Crustacea)</p></title><aug><au><snm>Crease</snm><fnm>TJ</fnm></au></aug><source>Gene</source><pubdate>1999</pubdate><volume>233</volume><issue>1-2</issue><fpage>89</fpage><lpage>99</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/S0378-1119(99)00151-1</pubid><pubid idtype="pmpid" link="fulltext">10375625</pubid></pubidlist></xrefbib></bibl><bibl id="B32"><title><p>The European Small Subunit Ribosomal RNA database</p></title><aug><au><snm>Van de Peer</snm><fnm>Y</fnm></au><au><snm>De Rijk</snm><fnm>P</fnm></au><au><snm>Wuyts</snm><fnm>J</fnm></au><au><snm>Winkelmans</snm><fnm>T</fnm></au><au><snm>De Wachter</snm><fnm>R</fnm></au></aug><source>Nucleic Acids Research</source><pubdate>2000</pubdate><volume>28</volume><issue>1</issue><fpage>175</fpage><lpage>176</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/nar/28.1.175</pubid><pubid idtype="pmcid">102429</pubid><pubid idtype="pmpid">10592217</pubid></pubidlist></xrefbib></bibl><bibl id="B33"><title><p>The complete mitochondrial genome of the sexual oribatid mite Steganacarus magnus: genome rearrangements and loss of tRNAs</p></title><aug><au><snm>Domes</snm><fnm>K</fnm></au><au><snm>Maraun</snm><fnm>M</fnm></au><au><snm>Scheu</snm><fnm>S</fnm></au><au><snm>Cameron</snm><fnm>SL</fnm></au></aug><source>Bmc Genomics</source><pubdate>2008</pubdate><volume>9</volume><xrefbib><pubidlist><pubid idtype="doi">10.1186/1471-2164-9-532</pubid><pubid idtype="pmcid">2588462</pubid><pubid idtype="pmpid">18992147</pubid></pubidlist></xrefbib></bibl><bibl id="B34"><title><p>The mitochondrial genome of the Japanese freshwater crab, Geothelphusa dehaani (Crustacea: Brachyura): Evidence for its evolution via gene duplication</p></title><aug><au><snm>Segawa</snm><fnm>RD</fnm></au><au><snm>Aotsuka</snm><fnm>T</fnm></au></aug><source>Gene</source><pubdate>2005</pubdate><volume>355</volume><fpage>28</fpage><lpage>39</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.gene.2005.05.020</pubid><pubid idtype="pmpid" link="fulltext">16039805</pubid></pubidlist></xrefbib></bibl><bibl id="B35"><title><p>Molecular mechanisms for the variation of mitochondrial gene content and gene arrangement among chigger mites of the genus Leptotrombidium (Acari: Acariformes)</p></title><aug><au><snm>Shao</snm><fnm>RF</fnm></au><au><snm>Barker</snm><fnm>SC</fnm></au><au><snm>Mitani</snm><fnm>H</fnm></au><au><snm>Takahashi</snm><fnm>M</fnm></au><au><snm>Fukunaga</snm><fnm>M</fnm></au></aug><source>Journal of Molecular Evolution</source><pubdate>2006</pubdate><volume>63</volume><issue>2</issue><fpage>251</fpage><lpage>261</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1007/s00239-005-0196-y</pubid><pubid idtype="pmpid" link="fulltext">16830100</pubid></pubidlist></xrefbib></bibl><bibl id="B36"><title><p>The complete mitochondrial genome of the house dust mite Dermatophagoides pteronyssinus (Trouessart): a novel gene arrangement among arthropods</p></title><aug><au><snm>Dermauw</snm><fnm>W</fnm></au><au><snm>Van Leeuwen</snm><fnm>T</fnm></au><au><snm>Vanholme</snm><fnm>B</fnm></au><au><snm>Tirry</snm><fnm>L</fnm></au></aug><source>Bmc Genomics</source><pubdate>2009</pubdate><volume>10</volume><xrefbib><pubidlist><pubid idtype="doi">10.1186/1471-2164-10-107</pubid><pubid idtype="pmcid">2680895</pubid><pubid idtype="pmpid">19284646</pubid></pubidlist></xrefbib></bibl><bibl id="B37"><title><p>Complete mitochondrial DNA sequence of the sea-firefly, Vargula hilgendorfii (Crustacea, Ostracoda) with duplicate control regions</p></title><aug><au><snm>Ogoh</snm><fnm>K</fnm></au><au><snm>Ohmiya</snm><fnm>Y</fnm></au></aug><source>Gene</source><pubdate>2004</pubdate><volume>327</volume><issue>1</issue><fpage>131</fpage><lpage>139</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.gene.2003.11.011</pubid><pubid idtype="pmpid" link="fulltext">14960368</pubid></pubidlist></xrefbib></bibl><bibl id="B38"><title><p>Mitochondrial genome sequence and gene order of Sipunculus nudus give additional support for an inclusion of Sipuncula into Annelida</p></title><aug><au><snm>Mwinyi</snm><fnm>A</fnm></au><au><snm>Meyer</snm><fnm>A</fnm></au><au><snm>Bleidorn</snm><fnm>C</fnm></au><au><snm>Lieb</snm><fnm>B</fnm></au><au><snm>Bartolomaeus</snm><fnm>T</fnm></au><au><snm>Podsiadlowski</snm><fnm>L</fnm></au></aug><source>Bmc Genomics</source><pubdate>2009</pubdate><volume>10</volume><xrefbib><pubidlist><pubid idtype="doi">10.1186/1471-2164-10-27</pubid><pubid idtype="pmcid">2639372</pubid><pubid idtype="pmpid">19149868</pubid></pubidlist></xrefbib></bibl><bibl id="B39"><title><p>Complete mitochondrial genome of Bugula neritina (Bryozoa, Gymnolaemata, Cheilostomata): phylogenetic position of Bryozoa and phylogeny of lophophorates within the Lophotrochozoa</p></title><aug><au><snm>Jang</snm><fnm>KH</fnm></au><au><snm>Hwang</snm><fnm>UW</fnm></au></aug><source>Bmc Genomics</source><pubdate>2009</pubdate><volume>10</volume><xrefbib><pubidlist><pubid idtype="doi">10.1186/1471-2164-10-167</pubid><pubid idtype="pmcid">2678162</pubid><pubid idtype="pmpid">19379522</pubid></pubidlist></xrefbib></bibl><bibl id="B40"><title><p>Complete sequence of the mitochondrial DNA of the primitive opisthobranch gastropod Pupa strigosa: systematic implication of the genome organization</p></title><aug><au><snm>Kurabayashi</snm><fnm>A</fnm></au><au><snm>Ueshima</snm><fnm>R</fnm></au></aug><source>Mol Biol Evol</source><pubdate>2000</pubdate><volume>17</volume><issue>2</issue><fpage>266</fpage><lpage>277</lpage><xrefbib><pubid idtype="pmpid" link="fulltext">10677849</pubid></xrefbib></bibl><bibl id="B41"><title><p>Phylogenetic position of the Pentastomida and (pan)crustacean relationships</p></title><aug><au><snm>Lavrov</snm><fnm>DV</fnm></au><au><snm>Brown</snm><fnm>WM</fnm></au><au><snm>Boore</snm><fnm>JL</fnm></au></aug><source>P Roy Soc Lond B Bio</source><pubdate>2004</pubdate><volume>271</volume><issue>1538</issue><fpage>537</fpage><lpage>544</lpage><xrefbib><pubid idtype="doi">10.1098/rspb.2003.2631</pubid></xrefbib></bibl><bibl id="B42"><title><p>Insect mitochondrial control region: A review of its structure, evolution and usefulness in evolutionary studies</p></title><aug><au><snm>Zhang</snm><fnm>DX</fnm></au><au><snm>Hewitt</snm><fnm>GM</fnm></au></aug><source>Biochemical Systematics and Ecology</source><pubdate>1997</pubdate><volume>25</volume><issue>2</issue><fpage>99</fpage><lpage>120</lpage><xrefbib><pubid idtype="doi">10.1016/S0305-1978(96)00042-7</pubid></xrefbib></bibl><bibl id="B43"><title><p>The mitochondrial genome: structure, transcription, translation and replication</p></title><aug><au><snm>Taanman</snm><fnm>JW</fnm></au></aug><source>Biochimica Et Biophysica Acta-Bioenergetics</source><pubdate>1999</pubdate><volume>1410</volume><issue>2</issue><fpage>103</fpage><lpage>123</lpage><xrefbib><pubid idtype="doi">10.1016/S0005-2728(98)00161-3</pubid></xrefbib></bibl><bibl id="B44"><title><p>Replication origin of mitochondrial DNA in insects</p></title><aug><au><snm>Saito</snm><fnm>S</fnm></au><au><snm>Tamura</snm><fnm>K</fnm></au><au><snm>Aotsuka</snm><fnm>T</fnm></au></aug><source>Genetics</source><pubdate>2005</pubdate><volume>171</volume><issue>4</issue><fpage>1695</fpage><lpage>1705</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1534/genetics.105.046243</pubid><pubid idtype="pmcid">1456096</pubid><pubid idtype="pmpid">16118189</pubid></pubidlist></xrefbib></bibl><bibl id="B45"><title><p>A Drosophila model of mitochondrial DNA replication: Proteins, genes and regulation</p></title><aug><au><snm>Garesse</snm><fnm>R</fnm></au><au><snm>Kaguni</snm><fnm>LS</fnm></au></aug><source>Iubmb Life</source><pubdate>2005</pubdate><volume>57</volume><issue>8</issue><fpage>555</fpage><lpage>561</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1080/15216540500215572</pubid><pubid idtype="pmpid" link="fulltext">16118113</pubid></pubidlist></xrefbib></bibl><bibl id="B46"><title><p>The complete mitochondrial genome of the demosponge Negombata magnifica (Poecilosclerida)</p></title><aug><au><snm>Belinky</snm><fnm>F</fnm></au><au><snm>Rot</snm><fnm>C</fnm></au><au><snm>Ilan</snm><fnm>M</fnm></au><au><snm>Huchon</snm><fnm>D</fnm></au></aug><source>Mol Phylogenet Evol</source><pubdate>2008</pubdate><volume>47</volume><issue>3</issue><fpage>1238</fpage><lpage>1243</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.ympev.2007.12.004</pubid><pubid idtype="pmpid" link="fulltext">18203626</pubid></pubidlist></xrefbib></bibl><bibl id="B47"><title><p>The complete mitochondrial DNA sequence of the horseshoe crab Limulus polyphemus</p></title><aug><au><snm>Lavrov</snm><fnm>DV</fnm></au><au><snm>Boore</snm><fnm>JL</fnm></au><au><snm>Brown</snm><fnm>WM</fnm></au></aug><source>Molecular Biology and Evolution</source><pubdate>2000</pubdate><volume>17</volume><issue>5</issue><fpage>813</fpage><lpage>824</lpage><xrefbib><pubid idtype="pmpid" link="fulltext">10779542</pubid></xrefbib></bibl><bibl id="B48"><title><p>Complete sequences of the highly rearranged molluscan mitochondrial genomes of the scaphopod Graptacme eborea and the bivalve Mytilus edulis</p></title><aug><au><snm>Boore</snm><fnm>JL</fnm></au><au><snm>Medina</snm><fnm>M</fnm></au><au><snm>Rosenberg</snm><fnm>LA</fnm></au></aug><source>Molecular Biology and Evolution</source><pubdate>2004</pubdate><volume>21</volume><issue>8</issue><fpage>1492</fpage><lpage>1503</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/molbev/msh090</pubid><pubid idtype="pmpid" link="fulltext">15014161</pubid></pubidlist></xrefbib></bibl><bibl id="B49"><title><p>Complete mitochondrial DNA sequence of oyster Crassostrea hongkongensis-a case of "Tandem duplication-random loss" for genome rearrangement in Crassostrea?</p></title><aug><au><snm>Yu</snm><fnm>ZN</fnm></au><au><snm>Wei</snm><fnm>ZP</fnm></au><au><snm>Kong</snm><fnm>XY</fnm></au><au><snm>Shi</snm><fnm>W</fnm></au></aug><source>Bmc Genomics</source><pubdate>2008</pubdate><volume>9</volume></bibl><bibl id="B50"><title><p>CREx: inferring genomic rearrangements based on common intervals</p></title><aug><au><snm>Bernt</snm><fnm>M</fnm></au><au><snm>Merkle</snm><fnm>D</fnm></au><au><snm>Ramsch</snm><fnm>K</fnm></au><au><snm>Fritzsch</snm><fnm>G</fnm></au><au><snm>Perseke</snm><fnm>M</fnm></au><au><snm>Bernhard</snm><fnm>D</fnm></au><au><snm>Schlegel</snm><fnm>M</fnm></au><au><snm>Stadler</snm><fnm>PF</fnm></au><au><snm>Middendorf</snm><fnm>M</fnm></au></aug><source>Bioinformatics</source><pubdate>2007</pubdate><volume>23</volume><issue>21</issue><fpage>2957</fpage><lpage>2958</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/bioinformatics/btm468</pubid><pubid idtype="pmpid" link="fulltext">17895271</pubid></pubidlist></xrefbib></bibl><bibl id="B51"><title><p>Complete mtDNA sequences of two millipedes suggest a new model for mitochondrial gene rearrangements: Duplication and nonrandom loss</p></title><aug><au><snm>Lavrov</snm><fnm>DV</fnm></au><au><snm>Boore</snm><fnm>JL</fnm></au><au><snm>Brown</snm><fnm>WM</fnm></au></aug><source>Molecular Biology and Evolution</source><pubdate>2002</pubdate><volume>19</volume><issue>2</issue><fpage>163</fpage><lpage>169</lpage><xrefbib><pubid idtype="pmpid" link="fulltext">11801744</pubid></xrefbib></bibl><bibl id="B52"><title><p>Tandem duplication of D-loop and ribosomal RNA sequences in lizard mitochondrial DNA</p></title><aug><au><snm>Moritz</snm><fnm>C</fnm></au><au><snm>Brown</snm><fnm>WM</fnm></au></aug><source>Science</source><pubdate>1986</pubdate><volume>233</volume><issue>4771</issue><fpage>1425</fpage><lpage>1427</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1126/science.3018925</pubid><pubid idtype="pmpid" link="fulltext">3018925</pubid></pubidlist></xrefbib></bibl><bibl id="B53"><title><p>Animal mitochondrial DNA recombination</p></title><aug><au><snm>Lunt</snm><fnm>DH</fnm></au><au><snm>Hyman</snm><fnm>BC</fnm></au></aug><source>Nature</source><pubdate>1997</pubdate><volume>387</volume><issue>6630</issue><fpage>247</fpage><lpage>247</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/387247a0</pubid><pubid idtype="pmpid" link="fulltext">9153388</pubid></pubidlist></xrefbib></bibl><bibl id="B54"><title><p>The highly rearranged mitochondrial genome of the plague thrips, Thrips imaginis (Insecta: thysanoptera): Convergence of two novel gene boundaries and an extraordinary arrangement of rRNA genes</p></title><aug><au><snm>Shao</snm><fnm>RF</fnm></au><au><snm>Barker</snm><fnm>SC</fnm></au></aug><source>Molecular Biology and Evolution</source><pubdate>2003</pubdate><volume>20</volume><issue>3</issue><fpage>362</fpage><lpage>370</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/molbev/msg045</pubid><pubid idtype="pmpid" link="fulltext">12644556</pubid></pubidlist></xrefbib></bibl><bibl id="B55"><title><p>Recombination in animal mitochondrial DNA: Evidence from published sequences</p></title><aug><au><snm>Ladoukakis</snm><fnm>ED</fnm></au><au><snm>Zouros</snm><fnm>E</fnm></au></aug><source>Molecular Biology and Evolution</source><pubdate>2001</pubdate><volume>18</volume><issue>11</issue><fpage>2127</fpage><lpage>2131</lpage><xrefbib><pubid idtype="pmpid" link="fulltext">11606710</pubid></xrefbib></bibl><bibl id="B56"><title><p>The mitochondrial subgenomes of the nematode Globodera pallida are mosaics: Evidence of recombination in an animal mitochondrial genome</p></title><aug><au><snm>Gibson</snm><fnm>T</fnm></au><au><snm>Blok</snm><fnm>VC</fnm></au><au><snm>Phillips</snm><fnm>MS</fnm></au><au><snm>Hong</snm><fnm>G</fnm></au><au><snm>Kumarasinghe</snm><fnm>D</fnm></au><au><snm>Riley</snm><fnm>IT</fnm></au><au><snm>Dowton</snm><fnm>M</fnm></au></aug><source>Journal of Molecular Evolution</source><pubdate>2007</pubdate><volume>64</volume><issue>4</issue><fpage>463</fpage><lpage>471</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1007/s00239-006-0187-7</pubid><pubid idtype="pmpid" link="fulltext">17479345</pubid></pubidlist></xrefbib></bibl><bibl id="B57"><title><p>Sequencing complete mitochondrial and plastid genomes</p></title><aug><au><snm>Burger</snm><fnm>G</fnm></au><au><snm>Lavrov</snm><fnm>DV</fnm></au><au><snm>Forget</snm><fnm>L</fnm></au><au><snm>Lang</snm><fnm>BF</fnm></au></aug><source>Nature Protocols</source><pubdate>2007</pubdate><volume>2</volume><issue>3</issue><fpage>603</fpage><lpage>614</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nprot.2007.59</pubid><pubid idtype="pmpid" link="fulltext">17406621</pubid></pubidlist></xrefbib></bibl><bibl id="B58"><title><p>A multipartite mitochondrial genome in the potato cyst nematode Globodera pallida</p></title><aug><au><snm>Armstrong</snm><fnm>MR</fnm></au><au><snm>Blok</snm><fnm>VC</fnm></au><au><snm>Phillips</snm><fnm>MS</fnm></au></aug><source>Genetics</source><pubdate>2000</pubdate><volume>154</volume><issue>1</issue><fpage>181</fpage><lpage>192</lpage><xrefbib><pubidlist><pubid idtype="pmcid">1460896</pubid><pubid idtype="pmpid">10628979</pubid></pubidlist></xrefbib></bibl><bibl id="B59"><title><p>Exceptionally high density of NUMTs in the honeybee genome</p></title><aug><au><snm>Pamilo</snm><fnm>P</fnm></au><au><snm>Viljakainen</snm><fnm>L</fnm></au><au><snm>Vihavainen</snm><fnm>A</fnm></au></aug><source>Molecular Biology and Evolution</source><pubdate>2007</pubdate><volume>24</volume><issue>6</issue><fpage>1340</fpage><lpage>1346</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/molbev/msm055</pubid><pubid idtype="pmpid" link="fulltext">17383971</pubid></pubidlist></xrefbib></bibl><bibl id="B60"><title><p>NUPTs in sequenced eukaryotes and their genomic organization in relation to NUMTs</p></title><aug><au><snm>Richly</snm><fnm>E</fnm></au><au><snm>Leister</snm><fnm>D</fnm></au></aug><source>Molecular Biology and Evolution</source><pubdate>2004</pubdate><volume>21</volume><issue>10</issue><fpage>1972</fpage><lpage>1980</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/molbev/msh210</pubid><pubid idtype="pmpid" link="fulltext">15254258</pubid></pubidlist></xrefbib></bibl><bibl id="B61"><title><p>DNA Damage Promotes Jumping between Templates during Enzymatic Amplification</p></title><aug><au><snm>Paabo</snm><fnm>S</fnm></au><au><snm>Irwin</snm><fnm>DM</fnm></au><au><snm>Wilson</snm><fnm>AC</fnm></au></aug><source>Journal of Biological Chemistry</source><pubdate>1990</pubdate><volume>265</volume><issue>8</issue><fpage>4718</fpage><lpage>4721</lpage><xrefbib><pubid idtype="pmpid" link="fulltext">2307682</pubid></xrefbib></bibl><bibl id="B62"><title><p>A broad survey of recombination in animal mitochondria</p></title><aug><au><snm>Piganeau</snm><fnm>G</fnm></au><au><snm>Gardner</snm><fnm>M</fnm></au><au><snm>Eyre-Walker</snm><fnm>A</fnm></au></aug><source>Molecular Biology and Evolution</source><pubdate>2004</pubdate><volume>21</volume><issue>12</issue><fpage>2319</fpage><lpage>2325</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/molbev/msh244</pubid><pubid idtype="pmpid" link="fulltext">15342796</pubid></pubidlist></xrefbib></bibl><bibl id="B63"><title><p>Recombination in animal mitochondrial DNA</p></title><aug><au><snm>Smith</snm><fnm>JM</fnm></au><au><snm>Smith</snm><fnm>NH</fnm></au></aug><source>Molecular Biology and Evolution</source><pubdate>2002</pubdate><volume>19</volume><issue>12</issue><fpage>2330</fpage><lpage>2332</lpage><xrefbib><pubid idtype="pmpid" link="fulltext">12446825</pubid></xrefbib></bibl><bibl id="B64"><title><p>Ecdysozoan Mitogenomics: Evidence for a Common Origin of the Legged Invertebrates, the Panarthropoda</p></title><aug><au><snm>Rota-Stabelli</snm><fnm>O</fnm></au><au><snm>Kayal</snm><fnm>E</fnm></au><au><snm>Gleeson</snm><fnm>D</fnm></au><au><snm>Daub</snm><fnm>J</fnm></au><au><snm>Boore</snm><fnm>JL</fnm></au><au><snm>Telford</snm><fnm>MJ</fnm></au><au><snm>Pisani</snm><fnm>D</fnm></au><au><snm>Blaxter</snm><fnm>M</fnm></au><au><snm>Lavrov</snm><fnm>DV</fnm></au></aug><source>Genome Biol Evol</source><pubdate>2010</pubdate><volume>2</volume><fpage>425</fpage><lpage>440</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/gbe/evq030</pubid><pubid idtype="pmcid">2998192</pubid><pubid idtype="pmpid">20624745</pubid></pubidlist></xrefbib></bibl><bibl id="B65"><title><p>Arthropod phylogeny revisited, with a focus on crustacean relationships</p></title><aug><au><snm>Koenemann</snm><fnm>S</fnm></au><au><snm>Jenner</snm><fnm>RA</fnm></au><au><snm>Hoenemann</snm><fnm>M</fnm></au><au><snm>Stemme</snm><fnm>T</fnm></au><au><snm>von Reumont</snm><fnm>BM</fnm></au></aug><source>Arthropod Struct Dev</source><pubdate>2010</pubdate><volume>39</volume><issue>2-3</issue><fpage>88</fpage><lpage>110</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.asd.2009.10.003</pubid><pubid idtype="pmpid" link="fulltext">19854296</pubid></pubidlist></xrefbib></bibl><bibl id="B66"><title><p>Extraordinary host switching in siphonostomatoid copepods and the demise of the Monstrilloida: Integrating molecular data, ontogeny and antennulary morphology</p></title><aug><au><snm>Huys</snm><fnm>R</fnm></au><au><snm>Llewellyn-Hughes</snm><fnm>J</fnm></au><au><snm>Conroy-Dalton</snm><fnm>S</fnm></au><au><snm>Olson</snm><fnm>PD</fnm></au><au><snm>Spinks</snm><fnm>JN</fnm></au><au><snm>Johnston</snm><fnm>DA</fnm></au></aug><source>Mol Phylogenet Evol</source><pubdate>2007</pubdate><volume>43</volume><issue>2</issue><fpage>368</fpage><lpage>378</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.ympev.2007.02.004</pubid><pubid idtype="pmpid" link="fulltext">17383905</pubid></pubidlist></xrefbib></bibl><bibl id="B67"><title><p>Small subunit rDNA and Bayesian inference reveal Pectenophilus ornatus (Copepoda incertae sedis) as highly transformed Mytilicolidae, and support assignment of Chondracanthidae and Xarifiidae to Lichomolgoidea (Cyclopoida)</p></title><aug><au><snm>Huys</snm><fnm>R</fnm></au><au><snm>Llewellyn-Hughes</snm><fnm>J</fnm></au><au><snm>Olson</snm><fnm>PD</fnm></au><au><snm>Nagasawa</snm><fnm>K</fnm></au></aug><source>Biol J Linn Soc</source><pubdate>2006</pubdate><volume>87</volume><issue>3</issue><fpage>403</fpage><lpage>425</lpage><xrefbib><pubid idtype="doi">10.1111/j.1095-8312.2005.00579.x</pubid></xrefbib></bibl><bibl id="B68"><title><p>Phylogenetic-Signal Dissection of Nuclear Housekeeping Genes Supports the Paraphyly of Sponges and the Monophyly of Eumetazoa</p></title><aug><au><snm>Sperling</snm><fnm>EA</fnm></au><au><snm>Peterson</snm><fnm>KJ</fnm></au><au><snm>Pisani</snm><fnm>D</fnm></au></aug><source>Molecular biology and evolution</source><pubdate>2009</pubdate><volume>26</volume><issue>10</issue><fpage>2261</fpage><lpage>2274</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/molbev/msp148</pubid><pubid idtype="pmpid" link="fulltext">19597161</pubid></pubidlist></xrefbib></bibl><bibl id="B69"><title><p>Arthropod relationships revealed by phylogenomic analysis of nuclear protein-coding sequences</p></title><aug><au><snm>Regier</snm><fnm>JC</fnm></au><au><snm>Shultz</snm><fnm>JW</fnm></au><au><snm>Zwick</snm><fnm>A</fnm></au><au><snm>Hussey</snm><fnm>A</fnm></au><au><snm>Ball</snm><fnm>B</fnm></au><au><snm>Wetzer</snm><fnm>R</fnm></au><au><snm>Martin</snm><fnm>JW</fnm></au><au><snm>Cunningham</snm><fnm>CW</fnm></au></aug><source>Nature</source><pubdate>2010</pubdate><volume>463</volume><issue>7284</issue><fpage>1079</fpage><lpage>U1098</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nature08742</pubid><pubid idtype="pmpid" link="fulltext">20147900</pubid></pubidlist></xrefbib></bibl><bibl id="B70"><title><p>Arthropod phylogeny based on eight molecular loci and morphology</p></title><aug><au><snm>Giribet</snm><fnm>G</fnm></au><au><snm>Edgecombe</snm><fnm>GD</fnm></au><au><snm>Wheeler</snm><fnm>WC</fnm></au></aug><source>Nature</source><pubdate>2001</pubdate><volume>413</volume><issue>6852</issue><fpage>157</fpage><lpage>161</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/35093097</pubid><pubid idtype="pmpid" link="fulltext">11557979</pubid></pubidlist></xrefbib></bibl><bibl id="B71"><title><p>Gene translocation links insects and crustaceans</p></title><aug><au><snm>Boore</snm><fnm>JL</fnm></au><au><snm>Lavrov</snm><fnm>DV</fnm></au><au><snm>Brown</snm><fnm>WM</fnm></au></aug><source>Nature</source><pubdate>1998</pubdate><volume>392</volume><issue>6677</issue><fpage>667</fpage><lpage>668</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/33577</pubid><pubid idtype="pmpid" link="fulltext">9565028</pubid></pubidlist></xrefbib></bibl><bibl id="B72"><title><p>Phylogenetic analysis of mitochondrial protein coding genes confirms the reciprocal paraphyly of Hexapoda and Crustacea</p></title><aug><au><snm>Carapelli</snm><fnm>A</fnm></au><au><snm>Lio</snm><fnm>P</fnm></au><au><snm>Nardi</snm><fnm>F</fnm></au><au><snm>van der Wath</snm><fnm>E</fnm></au><au><snm>Frati</snm><fnm>F</fnm></au></aug><source>BMC evolutionary biology</source><pubdate>2007</pubdate><volume>7</volume><xrefbib><pubidlist><pubid idtype="doi">10.1186/1471-2148-7-S2-S8</pubid><pubid idtype="pmcid">1963475</pubid><pubid idtype="pmpid">17767736</pubid></pubidlist></xrefbib></bibl><bibl id="B73"><title><p>Pancrustacean phylogeny: hexapods are terrestrial crustaceans and maxillopods are not monophyletic</p></title><aug><au><snm>Regier</snm><fnm>JC</fnm></au><au><snm>Shultz</snm><fnm>JW</fnm></au><au><snm>Kambic</snm><fnm>RE</fnm></au></aug><source>Proc Biol Sci</source><pubdate>2005</pubdate><volume>272</volume><issue>1561</issue><fpage>395</fpage><lpage>401</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1098/rspb.2004.2917</pubid><pubid idtype="pmcid">1634985</pubid><pubid idtype="pmpid">15734694</pubid></pubidlist></xrefbib></bibl><bibl id="B74"><title><p>DnaSP, DNA polymorphism analyses by the coalescent and other methods</p></title><aug><au><snm>Rozas</snm><fnm>J</fnm></au><au><snm>Sanchez-DelBarrio</snm><fnm>JC</fnm></au><au><snm>Messeguer</snm><fnm>X</fnm></au><au><snm>Rozas</snm><fnm>R</fnm></au></aug><source>Bioinformatics</source><pubdate>2003</pubdate><volume>19</volume><issue>18</issue><fpage>2496</fpage><lpage>2497</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/bioinformatics/btg359</pubid><pubid idtype="pmpid" link="fulltext">14668244</pubid></pubidlist></xrefbib></bibl><bibl id="B75"><title><p>Invertebrate species with nonpelagic larvae have elevated levels of nonsynonymous substitutions and reduced nucleotide diversities</p></title><aug><au><snm>Foltz</snm><fnm>DW</fnm></au></aug><source>Journal of Molecular Evolution</source><pubdate>2003</pubdate><volume>57</volume><issue>6</issue><fpage>607</fpage><lpage>612</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1007/s00239-003-2495-5</pubid><pubid idtype="pmpid" link="fulltext">14745529</pubid></pubidlist></xrefbib></bibl><bibl id="B76"><title><p>Base-calling of automated sequencer traces using phred. I. Accuracy assessment</p></title><aug><au><snm>Ewing</snm><fnm>B</fnm></au><au><snm>Hillier</snm><fnm>L</fnm></au><au><snm>Wendl</snm><fnm>MC</fnm></au><au><snm>Green</snm><fnm>P</fnm></au></aug><source>Genome Res</source><pubdate>1998</pubdate><volume>8</volume><issue>3</issue><fpage>175</fpage><lpage>185</lpage><xrefbib><pubid idtype="pmpid" link="fulltext">9521921</pubid></xrefbib></bibl><bibl id="B77"><title><p>Base-calling of automated sequencer traces using phred. II. Error probabilities</p></title><aug><au><snm>Ewing</snm><fnm>B</fnm></au><au><snm>Green</snm><fnm>P</fnm></au></aug><source>Genome Res</source><pubdate>1998</pubdate><volume>8</volume><issue>3</issue><fpage>186</fpage><lpage>194</lpage><xrefbib><pubid idtype="pmpid" link="fulltext">9521922</pubid></xrefbib></bibl><bibl id="B78"><title><p>Consed: A graphical tool for sequence finishing</p></title><aug><au><snm>Gordon</snm><fnm>D</fnm></au><au><snm>Abajian</snm><fnm>C</fnm></au><au><snm>Green</snm><fnm>P</fnm></au></aug><source>Genome Res</source><pubdate>1998</pubdate><volume>8</volume><issue>3</issue><fpage>195</fpage><lpage>202</lpage><xrefbib><pubid idtype="pmpid" link="fulltext">9521923</pubid></xrefbib></bibl><bibl id="B79"><title><p>tRNAscan-SE: A program for improved detection of transfer RNA genes in genomic sequence</p></title><aug><au><snm>Lowe</snm><fnm>TM</fnm></au><au><snm>Eddy</snm><fnm>SR</fnm></au></aug><source>Nucleic Acids Res</source><pubdate>1997</pubdate><volume>25</volume><issue>5</issue><fpage>955</fpage><lpage>964</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/nar/25.5.955</pubid><pubid idtype="pmcid">146525</pubid><pubid idtype="pmpid">9023104</pubid></pubidlist></xrefbib></bibl><bibl id="B80"><title><p>ARWEN: a program to detect tRNA genes in metazoan mitochondrial nucleotide sequences</p></title><aug><au><snm>Laslett</snm><fnm>D</fnm></au><au><snm>Canback</snm><fnm>B</fnm></au></aug><source>Bioinformatics</source><pubdate>2008</pubdate><volume>24</volume><issue>2</issue><fpage>172</fpage><lpage>175</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/bioinformatics/btm573</pubid><pubid idtype="pmpid" link="fulltext">18033792</pubid></pubidlist></xrefbib></bibl><bibl id="B81"><title><p>Dcse, an Interactive Tool for Sequence Alignment and Secondary Structure Research</p></title><aug><au><snm>Derijk</snm><fnm>P</fnm></au><au><snm>Dewachter</snm><fnm>R</fnm></au></aug><source>Computer Applications in the Biosciences</source><pubdate>1993</pubdate><volume>9</volume><issue>6</issue><fpage>735</fpage><lpage>740</lpage><xrefbib><pubid idtype="pmpid">7511479</pubid></xrefbib></bibl><bibl id="B82"><title><p>RnaViz 2: an improved representation of RNA secondary structure</p></title><aug><au><snm>De Rijk</snm><fnm>P</fnm></au><au><snm>Wuyts</snm><fnm>J</fnm></au><au><snm>De Wachter</snm><fnm>R</fnm></au></aug><source>Bioinformatics</source><pubdate>2003</pubdate><volume>19</volume><issue>2</issue><fpage>299</fpage><lpage>300</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/bioinformatics/19.2.299</pubid><pubid idtype="pmpid" link="fulltext">12538259</pubid></pubidlist></xrefbib></bibl><bibl id="B83"><title><p>Mfold web server for nucleic acid folding and hybridization prediction</p></title><aug><au><snm>Zuker</snm><fnm>M</fnm></au></aug><source>Nucleic Acids Research</source><pubdate>2003</pubdate><volume>31</volume><issue>13</issue><fpage>3406</fpage><lpage>3415</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/nar/gkg595</pubid><pubid idtype="pmcid">169194</pubid><pubid idtype="pmpid">12824337</pubid></pubidlist></xrefbib></bibl><bibl id="B84"><title><p>PAML 4: Phylogenetic analysis by maximum likelihood</p></title><aug><au><snm>Yang</snm><fnm>ZH</fnm></au></aug><source>Molecular Biology and Evolution</source><pubdate>2007</pubdate><volume>24</volume><issue>8</issue><fpage>1586</fpage><lpage>1591</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/molbev/msm088</pubid><pubid idtype="pmpid" link="fulltext">17483113</pubid></pubidlist></xrefbib></bibl><bibl id="B85"><title><p>Probalign: multiple sequence alignment using partition function posterior probabilities</p></title><aug><au><snm>Roshan</snm><fnm>U</fnm></au><au><snm>Livesay</snm><fnm>DR</fnm></au></aug><source>Bioinformatics</source><pubdate>2006</pubdate><volume>22</volume><issue>22</issue><fpage>2715</fpage><lpage>2721</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/bioinformatics/btl472</pubid><pubid idtype="pmpid" link="fulltext">16954142</pubid></pubidlist></xrefbib></bibl><bibl id="B86"><title><p>Archaea sister group of bacteria? Indications from tree reconstruction artefacts in ancient phylogenies</p></title><aug><au><snm>Brinkmann</snm><fnm>H</fnm></au><au><snm>Philippe</snm><fnm>H</fnm></au></aug><source>Mol Biol Evol</source><pubdate>1999</pubdate><volume>16</volume><issue>6</issue><fpage>817</fpage><lpage>825</lpage><xrefbib><pubid idtype="pmpid" link="fulltext">10368959</pubid></xrefbib></bibl></refgrp>
</bm></art>