<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art><ui>1471-2164-13-S7-S5</ui><ji>1471-2164</ji><fm>
<dochead>Proceedings</dochead>
<bibl>
<title>
<p>Quantitative assessment of mitochondrial DNA copies from whole genome sequencing</p>
</title>
<aug>
<au id="A1"><snm>Chu</snm><fnm>Hsueh-Ting</fnm><insr iid="I1"/><insr iid="I2"/><email>chu@live.asia.edu.tw</email></au>
<au id="A2"><snm>Hsiao</snm><mi>WL</mi><fnm>William</fnm><insr iid="I3"/><insr iid="I4"/><email>william.hsiao@gmail.com</email></au>
<au id="A3"><snm>Tsao</snm><mi>TH</mi><fnm>Theresa</fnm><insr iid="I5"/><email>postergrey@gmail.com</email></au>
<au id="A4"><snm>Chang</snm><fnm>Ching-Mao</fnm><insr iid="I6"/><email>magicbjp@gmail.com</email></au>
<au id="A5"><snm>Liu</snm><fnm>Yen-Wenn</fnm><insr iid="I8"/><email>skywenn@gmail.com</email></au>
<au id="A6"><snm>Fan</snm><fnm>Chen-Chieh</fnm><insr iid="I5"/><email>fanent@gmail.com</email></au>
<au id="A7"><snm>Lin</snm><fnm>Han</fnm><insr iid="I5"/><email>hotdogee@gmail.com</email></au>
<au id="A8"><snm>Chang</snm><fnm>Hen-Hong</fnm><insr iid="I6"/><insr iid="I7"/><email>tcmchh@mail.cgu.edu.tw</email></au>
<au id="A9"><snm>Yeh</snm><fnm>Tze-Jung</fnm><insr iid="I9"/><email>judyyah@gmail.com</email></au>
<au id="A10"><snm>Chen</snm><fnm>Jen-Chih</fnm><insr iid="I9"/><email>jchchen@ntu.edu.tw</email></au>
<au id="A11"><snm>Huang</snm><fnm>Dun-Ming</fnm><insr iid="I2"/><email>chobitshuang@yahoo.com.tw</email></au>
<au id="A12"><snm>Chen</snm><fnm>Chaur-Chin</fnm><insr iid="I10"/><email>cchen@cs.nthu.edu.tw</email></au>
<au ca="yes" id="A13"><snm>Kao</snm><fnm>Cheng-Yan</fnm><insr iid="I5"/><email>cykao@csie.ntu.edu.tw</email></au>
</aug>
<insg>
<ins id="I1"><p>Department of Biomedical informatics, Asia University, Taichung 41354, Taiwan</p></ins>
<ins id="I2"><p>Department of Computer Science and Information Engineering, Asia University, Taichung 41354, Taiwan</p></ins>
<ins id="I3"><p>BCCDC Public Health Microbiology &amp; Reference Laboratory, Vancouver, BC, V5Z 4R4, Canada</p></ins>
<ins id="I4"><p>Department of Pathology and Laboratory Medicine, Vancouver, BC, V5Z 4R4, Canada</p></ins>
<ins id="I5"><p>Department of Computer Science and Information Engineering, National Taiwan University, Taipei 10617, Taiwan</p></ins>
<ins id="I6"><p>Graduate Institute of Clinical Medical Science, Chang Gung University, Taoyuan 33302, Taiwan</p></ins>
<ins id="I7"><p>Center for Traditional Chinese Medicine, Chang Gung Memorial Hospital at Taoyuan, Chang Gung Medical Foundation, Taoyuan 33302, Taiwan</p></ins>
<ins id="I8"><p>Institute of Food Science and Technology, National Taiwan University, Taipei 10617, Taiwan</p></ins>
<ins id="I9"><p>Institute of Biotechnology, National Taiwan University, Taipei 10617, Taiwan</p></ins>
<ins id="I10"><p>Department of Computer Science, National Tsing Hua University, Hsinchu 30013, Taiwan</p></ins>
</insg>
<source>BMC Genomics</source>


<supplement><title><p>Eleventh International Conference on Bioinformatics (InCoB2012): Computational Biology</p></title><editor>Shoba Ranganathan, Christian Sch&#246;nbach, Sissades Tongsima, Jonathan Chan and Tin Wee Tan</editor><sponsor><note>The articles in this supplement were supported by funding agencies as detailed in the Acknowledgement section of each article</note></sponsor><note>Proceedings</note></supplement><conference><title><p>Asia Pacific Bioinformatics Network (APBioNet) Eleventh International Conference on Bioinformatics (InCoB2012)</p></title><location>Bangkok, Thailand</location><date-range>3-5 October 2012</date-range><url>http://www.incob2012.org/</url></conference><issn>1471-2164</issn>
<pubdate>2012</pubdate>
<volume>13</volume>
<issue>Suppl 7</issue>
<fpage>S5</fpage>
<url>http://www.biomedcentral.com/1471-2164/13/S5/S5</url>
<xrefbib><pubidlist><pubid idtype="pmpid">23282223</pubid><pubid idtype="doi">10.1186/1471-2164-13-S7-S5</pubid></pubidlist></xrefbib>
</bibl>
<history><pub><date><day>13</day><month>12</month><year>2012</year></date></pub></history>
<cpyrt><year>2012</year><collab>Chu et al.; licensee BioMed Central Ltd.</collab><note>This is an open access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note></cpyrt>
<abs>
<sec>
<st>
<p>Abstract</p>
</st>
<sec>
<st>
<p>Background</p>
</st>
<p>Mitochondrial dysfunction is associated with various aging diseases. The copy number of mtDNA in human cells may therefore be a potential biomarker for diagnostics of aging. Here we propose a new computational method for the accurate assessment of mtDNA copies from whole genome sequencing data.</p>
</sec>
<sec>
<st>
<p>Results</p>
</st>
<p>Two families of the human whole genome sequencing datasets from the HapMap and the 1000 Genomes projects were used for the accurate counting of mitochondrial DNA copy numbers. The results revealed the parental mitochondrial DNA copy numbers are significantly lower than that of their children in these samples. There are 8%~21% more copies of mtDNA in samples from the children than from their parents. The experiment demonstrated the possible correlations between the quantity of mitochondrial DNA and aging-related diseases.</p>
</sec>
<sec>
<st>
<p>Conclusions</p>
</st>
<p>Since the next-generation sequencing technology strives to deliver affordable and non-biased sequencing results, accurate assessment of mtDNA copy numbers can be achieved effectively from the output of whole genome sequencing. We implemented the method as a software package MitoCounter with the source code and user's guide available to the public at <url>http://sourceforge.net/projects/mitocounter/</url>.</p>
</sec>
</sec>
</abs>
</fm><bdy>
<sec>
<st>
<p>Background</p>
</st>
<p>Human mitochondria contain multiple copies of a 16.5 k bp, double-stranded, circular DNA molecule (Figure <figr fid="F1">1a</figr>). Since mitochondria are the organelles that generate chemical energy for cellular functions, many disease symptoms are linked to mitochondrial dysfunction, including poor growth, muscle weakness, hearing problems, visual problems, heart diseases, and liver diseases. There were many recent studies which showed significantly reduced mitochondrial DNA (mtDNA) copy numbers in cell samples of aging-related diseases <abbrgrp>
<abbr bid="B1">1</abbr>
<abbr bid="B2">2</abbr>
<abbr bid="B3">3</abbr>
</abbrgrp>. A recent study also reported that mtDNA copy number is associated with cancer risk <abbrgrp>
<abbr bid="B4">4</abbr>
</abbrgrp>. Therefore, quantitative assessment of mtDNA in human cells can elucidate the relationship between mitochondrial diseases and mitochondrial dysfunction.</p>
<fig id="F1"><title><p>Figure 1</p></title><caption><p>Overview of human whole genome sequencing</p></caption><text>
   <p><b>Overview of human whole genome sequencing</b>. <b>A) </b>The human genome is composed of nuclear DNA and mitochondrial DNA. The nuclear DNA is stored on 23 chromosome pairs and there are multiple copies of small DNA located in mitochondria. <b>B) </b>The reads from the sequencing of human whole genome are mixed with both nuclear DNA and mitochondrial DNA.</p>
</text><graphic file="1471-2164-13-S7-S5-1"/></fig>
<p>In the past decade, quantitative real-time PCR assays were developed to estimate relative levels of mtDNA copy numbers in samples <abbrgrp>
<abbr bid="B2">2</abbr>
<abbr bid="B5">5</abbr>
<abbr bid="B6">6</abbr>
</abbrgrp>. This approach measures the mtDNA copy number by determining the ratio of PCR amplicons to that of a single nuclear gene in experimental samples. The recent development of next-generation sequencing technology (NGS) revolutionized genomic studies and produced accurate whole genome sequencing (WGS) datasets <abbrgrp>
<abbr bid="B7">7</abbr>
</abbrgrp>. As shown in Figure <figr fid="F1">1b</figr>, the output from human whole genome sequencing consists of both nuclear DNA (nuDNA) and mitochondrial DNA (mtDNA) molecules, thus it is convenient to assess mtDNA copy number from WGS dataset and can be an alternative to real-time PCR assays.</p>
<p>Here we demonstrate a computational method for counting mtDNA copy number using WGS datasets. The three steps in the process are (1) typing of mtDNA, (2) separation of mtDNA reads, and (3) calculation of mtDNA count. We developed a freely available software package called MitoCounter for this purpose. MitoCounter can be used to calculate the average copy numbers of mtDNA molecules in the sequenced samples. Besides, the separated mtDNA reads provide further analysis of mtDNA heteroplasmy. The mtDNA heteroplasmy represents the mixture of individual mtDNA mutations. Heteroplasmy levels can alter the clinical penetrance of primary mtDNA diseases <abbrgrp>
<abbr bid="B8">8</abbr>
<abbr bid="B9">9</abbr>
</abbrgrp>.</p>
</sec>
<sec>
<st>
<p>Methods</p>
</st>
<sec>
<st>
<p>A computational assay for counting mtDNA copies from a WGS dataset</p>
</st>
<p>Since the library construction bias is minimized with the next-generation sequencing platform <abbrgrp>
<abbr bid="B10">10</abbr>
</abbrgrp>, both mitochondrial DNA (mtDNA) and nuclear DNA (nuDNA) are sequenced together with equal opportunities. The output dataset comprises a mixture of mtDNA reads and nuDNA reads. Let the total number of nucleotide bases in the nuclear genome be 2N (for diploid chromosomes) and the number of bases in a mitochondrial DNA is M. Then the summation of nucleotide bases in the entire human genome is 2N+kM, where k is the number of mtDNA copies. The numbers of reads from nuDNA and the number of reads from mtDNA should reflect the ratio of 2N:kM.</p>
<p>That is,</p>
<p>
<display-formula id="M1">
<m:math xmlns:m="http://www.w3.org/1998/Math/MathML" name="1471-2164-13-S7-S5-i1"><m:mrow>
   <m:mfrac>
      <m:mrow>
         <m:mi>m</m:mi>
         <m:mi>t</m:mi>
         <m:mi>B</m:mi>
         <m:mi>a</m:mi>
         <m:mi>s</m:mi>
         <m:mi>e</m:mi>
         <m:mi>s</m:mi>
      </m:mrow>
      <m:mrow>
         <m:mi>a</m:mi>
         <m:mi>l</m:mi>
         <m:mi>l</m:mi>
         <m:mi>B</m:mi>
         <m:mi>a</m:mi>
         <m:mi>s</m:mi>
         <m:mi>e</m:mi>
         <m:mi>s</m:mi>
      </m:mrow>
   </m:mfrac>
   <m:mo class="MathClass-rel">=</m:mo>
   <m:mfrac>
      <m:mrow>
         <m:mi>k</m:mi>
         <m:mi>M</m:mi>
      </m:mrow>
      <m:mrow>
         <m:mn>2</m:mn>
         <m:mi>N</m:mi>
         <m:mo class="MathClass-bin">+</m:mo>
         <m:mi>k</m:mi>
         <m:mi>M</m:mi>
      </m:mrow>
   </m:mfrac>
</m:mrow>
</m:math>
</display-formula>
</p>
<p>where <it>mtBases </it>is the total bases of sequenced reads from mtDNA and <it>allBases </it>is total bases of all sequenced reads from the output of a WGS procedure.</p>
<p>From an entire dataset of human whole genome sequencing, we separate the mtDNA reads from the others. Then the number of mtDNA copies can be approximated as</p>
<p>
<display-formula id="M2">
<m:math xmlns:m="http://www.w3.org/1998/Math/MathML" name="1471-2164-13-S7-S5-i2"><m:mrow>
   <m:mi>k</m:mi>
   <m:mo class="MathClass-rel">=</m:mo>
   <m:mfrac>
      <m:mrow>
         <m:mi>m</m:mi>
         <m:mi>t</m:mi>
         <m:mi>B</m:mi>
         <m:mi>a</m:mi>
         <m:mi>s</m:mi>
         <m:mi>e</m:mi>
         <m:mi>s</m:mi>
         <m:mo class="MathClass-bin">*</m:mo>
         <m:mn>2</m:mn>
         <m:mi>N</m:mi>
      </m:mrow>
      <m:mrow>
         <m:mrow>
            <m:mo class="MathClass-open">(</m:mo>
            <m:mrow>
               <m:mi>a</m:mi>
               <m:mi>l</m:mi>
               <m:mi>l</m:mi>
               <m:mi>B</m:mi>
               <m:mi>a</m:mi>
               <m:mi>s</m:mi>
               <m:mi>e</m:mi>
               <m:mi>s</m:mi>
               <m:mo class="MathClass-bin">-</m:mo>
               <m:mi>m</m:mi>
               <m:mi>t</m:mi>
               <m:mi>B</m:mi>
               <m:mi>a</m:mi>
               <m:mi>s</m:mi>
               <m:mi>e</m:mi>
               <m:mi>s</m:mi>
            </m:mrow>
            <m:mo class="MathClass-close">)</m:mo>
         </m:mrow>
         <m:mo class="MathClass-bin">*</m:mo>
         <m:mi>M</m:mi>
      </m:mrow>
   </m:mfrac>
</m:mrow>
</m:math>
</display-formula>
</p>
<p>The equation for counting mtDNA copies is not suitable for plants (e.g. Arabidopsis) since their mtDNA sequences may contain segments of nuclear DNA. Besides, there are usually other DNA molecules in their cells, such as chloroplast genome and plasmid genome.</p>
</sec>
<sec>
<st>
<p>Software implementation</p>
</st>
<p>In order to precisely separate mtDNA reads from a WGS dataset, it is necessary to determine the genotype of the mitochondrial genome first. We designed a program WgsMitoAssembler to identify the homoplasmic sequences, which present the inherent mutations in most of mtDNA molecules. The program WgsMitoAssembler is a guided assembler, and it requires a reference mitochondrial sequence which is used to choose a beginning read and an ending read from the entire WGS dataset. We use the reference mtDNA sequence (GenBank: <ext-link ext-link-id="NC_001807.4" ext-link-type="gen">NC_001807.4</ext-link>) for the purpose. We then search for best candidate reads which can extend the beginning read from the 3' end to the 5' end until the ending read is met.</p>
<p>After the typing of the target mitochondrial genome, the homoplasmy sequence is used in the second program WgsMitoCounter. The program performs the job of separating mitochondrial reads from the entire WGS dataset. Considering that some of sequenced reads may contain erroneous bases, we design an error-tolerant mapping algorithm shown in Figure <figr fid="F2">2</figr>. We search for sub-sequences of paired reads which are indexed as mtDNA fragments and the accuracy of mapping is determined by the pairing distances. WgsMitoCounter will output a CSV file which records the number of mitochondrial reads in each run of the entire dataset. The template of final calculation for mtDNA copy number is provided in Additional file <supplr sid="S1">1</supplr>.</p>
<fig id="F2"><title><p>Figure 2</p></title><caption><p>An error-tolerant mapping algorithm for filtering mitochondrial reads</p></caption><text>
   <p><b>An error-tolerant mapping algorithm for filtering mitochondrial reads</b>.</p>
</text><graphic file="1471-2164-13-S7-S5-2"/></fig>
<suppl id="S1">
<title>
<p>Additional File 1</p>
</title>
<text>
<p>
<b>Supplementary Software</b>. The MitoCounter software package consists of two execution programs, an Excel template and a User manual.</p>
</text>
<file name="1471-2164-13-S7-S5-S1.xls">
   <p>Click here for file</p>
</file>
</suppl>
</sec>
</sec>
<sec>
<st>
<p>Results and discussion</p>
</st>
<sec>
<st>
<p>Parental mtDNA samples have less copy numbers</p>
</st>
<p>We apply the analysis to public WGS datasets from the HapMap <abbrgrp>
<abbr bid="B11">11</abbr>
</abbrgrp> and 1000 Genomes <abbrgrp>
<abbr bid="B12">12</abbr>
</abbrgrp> projects. We chose six high-coverage WGS datasets for two pedigree trios: YOR009 and CEPH146 and two low-coverage WGS datasets for individual elders (Sample ID: NA11831 and NA06985), listed in Table <tblr tid="T1">1</tblr>. YOR009 is an African family. CEPH1463 is a family from Utah with Northern and Western European ancestry. The two individuals are also from the CEPH population and were recorded as the grandparents in the 1000 Genomes project. These DNA samples were isolated from B-lymphocyte cells derived from blood. Table <tblr tid="T2">2</tblr> lists the results of counting mitochondrial DNA on the selected datasets. The mtDNA counts for the YOR009 family are between 645~752 and for CEPH1463 family are between 734~950. Besides, the mtDNA counts for the two individual elders are 662 and 755.</p>
<tbl id="T1"><title><p>Table 1</p></title><caption><p>Typing of mtDNA from whole genome sequencing samples</p></caption><tblbdy cols="7">
      <r>
         <c ca="left">
            <p>
               <b>HapMap Family</b>
            </p>
         </c>
         <c ca="left">
            <p>
               <b>Sample</b>
            </p>
         </c>
         <c ca="left">
            <p>
               <b>Sex</b>
            </p>
         </c>
         <c ca="left">
            <p>
               <b>Relation</b>
            </p>
         </c>
         <c ca="left">
            <p>
               <b>mtDNA</b>
            </p>
            <p>
               <b>Length</b>
            </p>
         </c>
         <c ca="left">
            <p>
               <b>Haplogroup*</b>
            </p>
            <p>
               <b>Reference</b>
            </p>
         </c>
         <c ca="left">
            <p>
               <b>Haplogroup</b>
            </p>
         </c>
      </r>
      <r>
         <c cspan="7">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>YOR009</p>
         </c>
         <c ca="left">
            <p>NA18507</p>
         </c>
         <c ca="left">
            <p>male</p>
         </c>
         <c ca="left">
            <p>father</p>
         </c>
         <c ca="left">
            <p>16567 bp</p>
         </c>
         <c ca="left">
            <p>AF346986</p>
         </c>
         <c ca="left">
            <p>L1b</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>NA18508</p>
         </c>
         <c ca="left">
            <p>female</p>
         </c>
         <c ca="left">
            <p>mother</p>
         </c>
         <c ca="left">
            <p>16567 bp</p>
         </c>
         <c ca="left">
            <p>DQ341073</p>
         </c>
         <c ca="left">
            <p>L3b</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>NA18506</p>
         </c>
         <c ca="left">
            <p>male</p>
         </c>
         <c ca="left">
            <p>child</p>
         </c>
         <c ca="left">
            <p>16567 bp</p>
         </c>
         <c ca="left">
            <p>DQ341073</p>
         </c>
         <c ca="left">
            <p>L3b</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>CEPH146</p>
         </c>
         <c ca="left">
            <p>NA12891</p>
         </c>
         <c ca="left">
            <p>male</p>
         </c>
         <c ca="left">
            <p>father</p>
         </c>
         <c ca="left">
            <p>16572 bp</p>
         </c>
         <c ca="left">
            <p>EU715237</p>
         </c>
         <c ca="left">
            <p>H1</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>NA12892</p>
         </c>
         <c ca="left">
            <p>female</p>
         </c>
         <c ca="left">
            <p>mother</p>
         </c>
         <c ca="left">
            <p>16570 bp</p>
         </c>
         <c ca="left">
            <p>GU945543</p>
         </c>
         <c ca="left">
            <p>H13a1a1</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>NA12878</p>
         </c>
         <c ca="left">
            <p>female</p>
         </c>
         <c ca="left">
            <p>child</p>
         </c>
         <c ca="left">
            <p>16570 bp</p>
         </c>
         <c ca="left">
            <p>GU945543</p>
         </c>
         <c ca="left">
            <p>H13a1a1</p>
         </c>
      </r>
      <r>
         <c cspan="7">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>CEPH1350</p>
         </c>
         <c ca="left">
            <p>NA11831</p>
         </c>
         <c ca="left">
            <p>male</p>
         </c>
         <c ca="left">
            <p>grandfather</p>
         </c>
         <c ca="left">
            <p>16569 bp</p>
         </c>
         <c ca="left">
            <p>AY495174</p>
         </c>
         <c ca="left">
            <p>H5</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>CEPH1341</p>
         </c>
         <c ca="left">
            <p>NA06985</p>
         </c>
         <c ca="left">
            <p>female</p>
         </c>
         <c ca="left">
            <p>grandmother</p>
         </c>
         <c ca="left">
            <p>16569 bp</p>
         </c>
         <c ca="left">
            <p>AY882388</p>
         </c>
         <c ca="left">
            <p>U4b</p>
         </c>
      </r>
   </tblbdy><tblfn>
      <p>* The references for haplogroup are selected from top BLAST results.</p>
   </tblfn></tbl>
<tbl id="T2"><title><p>Table 2</p></title><caption><p>Counts of mtDNA from whole genome sequencing samples</p></caption><tblbdy cols="8">
      <r>
         <c ca="left">
            <p>
               <b>Sample</b>
            </p>
         </c>
         <c ca="left">
            <p>
               <b>Relation</b>
            </p>
         </c>
         <c ca="left">
            <p>
               <b>SRA ID</b>
            </p>
            <p>
               <b>of dataset</b>
            </p>
         </c>
         <c ca="right">
            <p>
               <b>Runs</b>
            </p>
            <p>
               <b>in dataset</b>
            </p>
         </c>
         <c ca="right">
            <p>
               <b>Total</b>
            </p>
            <p>
               <b>bases</b>
            </p>
         </c>
         <c ca="right">
            <p>
               <b>mtDNA</b>
            </p>
            <p>
               <b>bases</b>
            </p>
         </c>
         <c ca="right">
            <p>
               <b>mtDNA</b>
            </p>
            <p>
               <b>Ratio</b>
            </p>
         </c>
         <c ca="right">
            <p>
               <b>mtDNA</b>
            </p>
            <p>
               <b>count</b>
            </p>
         </c>
      </r>
      <r>
         <c cspan="8">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>NA18507</p>
         </c>
         <c ca="left">
            <p>father</p>
         </c>
         <c ca="left">
            <p>ERX009609</p>
         </c>
         <c ca="right">
            <p>24</p>
         </c>
         <c ca="right">
            <p>135.2G</p>
         </c>
         <c ca="right">
            <p>249.8M</p>
         </c>
         <c ca="right">
            <p>0.185%</p>
         </c>
         <c ca="right">
            <p>646.84</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>NA18508</p>
         </c>
         <c ca="left">
            <p>mother</p>
         </c>
         <c ca="left">
            <p>ERX009610</p>
         </c>
         <c ca="right">
            <p>24</p>
         </c>
         <c ca="right">
            <p>133.2G</p>
         </c>
         <c ca="right">
            <p>239.7M</p>
         </c>
         <c ca="right">
            <p>0.180%</p>
         </c>
         <c ca="right">
            <p>629.69</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>NA18506</p>
         </c>
         <c ca="left">
            <p>child</p>
         </c>
         <c ca="left">
            <p>ERX009608</p>
         </c>
         <c ca="right">
            <p>24</p>
         </c>
         <c ca="right">
            <p>132.3G</p>
         </c>
         <c ca="right">
            <p>273.0M</p>
         </c>
         <c ca="right">
            <p>0.206%</p>
         </c>
         <c ca="right">
            <p>722.10</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>NA12891</p>
         </c>
         <c ca="left">
            <p>father</p>
         </c>
         <c ca="left">
            <p>ERX000172</p>
         </c>
         <c ca="right">
            <p>35</p>
         </c>
         <c ca="right">
            <p>1.538G</p>
         </c>
         <c ca="right">
            <p>36.78M</p>
         </c>
         <c ca="right">
            <p>0.239%</p>
         </c>
         <c ca="right">
            <p>837.11</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>NA12892</p>
         </c>
         <c ca="left">
            <p>mother</p>
         </c>
         <c ca="left">
            <p>ERX000174</p>
         </c>
         <c ca="right">
            <p>42</p>
         </c>
         <c ca="right">
            <p>1.543G</p>
         </c>
         <c ca="right">
            <p>31.58M</p>
         </c>
         <c ca="right">
            <p>0.205%</p>
         </c>
         <c ca="right">
            <p>716.25</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>NA12878</p>
         </c>
         <c ca="left">
            <p>child</p>
         </c>
         <c ca="left">
            <p>ERX000170</p>
         </c>
         <c ca="right">
            <p>55</p>
         </c>
         <c ca="right">
            <p>2.762G</p>
         </c>
         <c ca="right">
            <p>71.92M</p>
         </c>
         <c ca="right">
            <p>0.260%</p>
         </c>
         <c ca="right">
            <p>911.38</p>
         </c>
      </r>
      <r>
         <c cspan="8">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>NA11831</p>
         </c>
         <c ca="left">
            <p>grandfather</p>
         </c>
         <c ca="left">
            <p>SRX116265</p>
         </c>
         <c ca="right">
            <p>1</p>
         </c>
         <c ca="right">
            <p>4.15G</p>
         </c>
         <c ca="right">
            <p>7.54M</p>
         </c>
         <c ca="right">
            <p>0.182%</p>
         </c>
         <c ca="right">
            <p>662.74</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>NA06985</p>
         </c>
         <c ca="left">
            <p>grandmother</p>
         </c>
         <c ca="left">
            <p>SRX116266</p>
         </c>
         <c ca="right">
            <p>1</p>
         </c>
         <c ca="right">
            <p>11.97G</p>
         </c>
         <c ca="right">
            <p>24.80M</p>
         </c>
         <c ca="right">
            <p>0.207%</p>
         </c>
         <c ca="right">
            <p>755.92</p>
         </c>
      </r>
   </tblbdy></tbl>
<p>For the counting results of these WGS samples (Additional file <supplr sid="S2">2</supplr> and <supplr sid="S3">3</supplr>), ANOVA analysis revealed significant differences among the mtDNA counts within each family group: for YOR009, <it>F</it>(2,69) = 916.01, <it>p </it>= 2.06E-50 and for CEPH1463, <it>F</it>(2,169) = 58.75, <it>p </it>= 7.26632E-19. It showed that the offspring had 8%~23% more mtDNA than their parents in these samples. Although we did not investigate the possible artefacts caused by sequencing procedures, the results consistently demonstrated that there are more mtDNA sequences within younger persons' lymphocyte cells.</p>
<suppl id="S2">
<title>
<p>Additional File 2</p>
</title>
<text>
<p>
<b>Supplementary Table 1</b>. An Excel file lists the ratio of mitochondrial reads in each WGS run for the family trio YOR009 samples.</p>
</text>
<file name="1471-2164-13-S7-S5-S2.xls">
   <p>Click here for file</p>
</file>
</suppl>
<suppl id="S3">
<title>
<p>Additional File 3</p>
</title>
<text>
<p>
<b>Supplementary Table 2</b>. An Excel file lists the ratio of mitochondrial reads in each WGS run for the family trio CEPH1463 samples.</p>
</text>
<file name="1471-2164-13-S7-S5-S3.zip">
   <p>Click here for file</p>
</file>
</suppl>
</sec>
</sec>
<sec>
<st>
<p>Conclusions</p>
</st>
<p>Many studies suggested that mitochondrial functions become defective as we age. Recent findings suggests that structural changes in mitochondria, including increased mitochondrial fragmentation and decreased mitochondrial fusion, are critical factors associated with mitochondrial dysfunction and cell death in aging and neurodegenerative diseases <abbrgrp>
<abbr bid="B13">13</abbr>
<abbr bid="B14">14</abbr>
</abbrgrp>. Therefore, the proposed quantitative analysis of mtDNA can help to further elucidate the dynamics of mitochondrial diseases. It is expected that cost for sequencing personal whole genome will be less than $1000 in the near future. For the purpose of counting mitochondrial DNA, it only requires a low coverage of the whole genome and the cost may be further reduced to $50. The cost-effectiveness of the procedure makes the proposed method of counting mitochondrial DNA as a useful diagnostic tool to study aging and aging-related diseases for individuals.</p>
</sec>
<sec>
<st>
<p>Availability and requirements</p>
</st>
<p>In the MitoCounter software package, both the programs WgsMitoAssembler and WgsMitoCounter were implemented in C# with the .NET Framework which can be run on 64-bit Windows. The program WgsMitoCounter requires paired-end WGS datasets from Illumina sequencing platform. The MitoCounter software with a user manual is available at the Web site: <url>http://sourceforge.net/projects/mitocounter/</url>
</p>
</sec>
<sec>
<st>
<p>Competing interests</p>
</st>
<p>The authors declare that they have no competing interests.</p>
</sec>
<sec>
<st>
<p>Authors' contributions</p>
</st>
<p>HTC devised the method and wrote the software. HTC, WLH, TTT, CMC, YWL, CCF, HHC, HL, TJY, JCC, DMH and CCC discussed the project and jointly wrote the manuscript. CYK conceived the project.</p>
</sec>
</bdy><bm>
<ack>
<sec>
<st>
<p>Acknowledgements</p>
</st>
<p>We thank the members in the Bioinformatics Lab, NTU, for valuable discussions and useful insights. We thank C. S. Chiou, C. H. Chan and C. F. Chang for comments and discussion. We thank Flora Kao for editing and proofreading.</p>
<p>This article has been published as part of <it>BMC Genomics </it>Volume 13 Supplement 7, 2012: Eleventh International Conference on Bioinformatics (InCoB2012): Computational Biology. The full contents of the supplement are available online at <url>http://www.biomedcentral.com/bmcgenomics/supplements/13/S7</url>.</p>
</sec>
</ack>
<refgrp><bibl id="B1"><title><p>Mitochondrial diabetes: molecular mechanisms and clinical presentation</p></title><aug><au><snm>Maassen</snm><fnm>JA</fnm></au><au><snm>LM</snm><fnm>TH</fnm></au><au><snm>Van Essen</snm><fnm>E</fnm></au><au><snm>Heine</snm><fnm>RJ</fnm></au><au><snm>Nijpels</snm><fnm>G</fnm></au><au><snm>Jahangir Tafrechi</snm><fnm>RS</fnm></au><au><snm>Raap</snm><fnm>AK</fnm></au><au><snm>Janssen</snm><fnm>GM</fnm></au><au><snm>Lemkes</snm><fnm>HH</fnm></au></aug><source>Diabetes</source><pubdate>2004</pubdate><volume>53</volume><issue>Suppl 1</issue><fpage>S103</fpage><lpage>109</lpage><xrefbib><pubid idtype="pmpid" link="fulltext">14749274</pubid></xrefbib></bibl><bibl id="B2"><title><p>Mitochondrial DNA copy number and function decrease with age in the short-lived fish Nothobranchius furzeri</p></title><aug><au><snm>Hartmann</snm><fnm>N</fnm></au><au><snm>Reichwald</snm><fnm>K</fnm></au><au><snm>Wittig</snm><fnm>I</fnm></au><au><snm>Drose</snm><fnm>S</fnm></au><au><snm>Schmeisser</snm><fnm>S</fnm></au><au><snm>Luck</snm><fnm>C</fnm></au><au><snm>Hahn</snm><fnm>C</fnm></au><au><snm>Graf</snm><fnm>M</fnm></au><au><snm>Gausmann</snm><fnm>U</fnm></au><au><snm>Terzibasi</snm><fnm>E</fnm></au><etal/></aug><source>Aging Cell</source><pubdate>2011</pubdate><volume>10</volume><issue>5</issue><fpage>824</fpage><lpage>831</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1111/j.1474-9726.2011.00723.x</pubid><pubid idtype="pmpid" link="fulltext">21624037</pubid></pubidlist></xrefbib></bibl><bibl id="B3"><title><p>Age-related decline in mitochondrial DNA copy number in isolated human pancreatic islets</p></title><aug><au><snm>Cree</snm><fnm>LM</fnm></au><au><snm>Patel</snm><fnm>SK</fnm></au><au><snm>Pyle</snm><fnm>A</fnm></au><au><snm>Lynn</snm><fnm>S</fnm></au><au><snm>Turnbull</snm><fnm>DM</fnm></au><au><snm>Chinnery</snm><fnm>PF</fnm></au><au><snm>Walker</snm><fnm>M</fnm></au></aug><source>Diabetologia</source><pubdate>2008</pubdate><volume>51</volume><issue>8</issue><fpage>1440</fpage><lpage>1443</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1007/s00125-008-1054-4</pubid><pubid idtype="pmpid" link="fulltext">18528676</pubid></pubidlist></xrefbib></bibl><bibl id="B4"><title><p>Mitochondrial copy number is associated with colorectal cancer risk</p></title><aug><au><snm>Thyagarajan</snm><fnm>B</fnm></au><au><snm>Wang</snm><fnm>R</fnm></au><au><snm>Barcelo</snm><fnm>H</fnm></au><au><snm>Koh</snm><fnm>WP</fnm></au><au><snm>Yuan</snm><fnm>JM</fnm></au></aug><source>Cancer Epidemiol Biomarkers Prev</source><pubdate>2012</pubdate></bibl><bibl id="B5"><title><p>An alternative PCR assay for quantifying mitochondrial DNA in crude preparations</p></title><aug><au><snm>Koekemoer</snm><fnm>TC</fnm></au><au><snm>Downing</snm><fnm>TG</fnm></au><au><snm>Oelofsen</snm><fnm>W</fnm></au></aug><source>Nucleic Acids Research</source><pubdate>1998</pubdate><volume>26</volume><issue>11</issue><fpage>2829</fpage><lpage>2830</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/nar/26.11.2829</pubid><pubid idtype="pmcid">147619</pubid><pubid idtype="pmpid" link="fulltext">9592176</pubid></pubidlist></xrefbib></bibl><bibl id="B6"><title><p>Effects of aging on mitochondrial DNA copy number and cytochrome c oxidase gene expression in rat skeletal muscle, liver, and heart</p></title><aug><au><snm>Barazzoni</snm><fnm>R</fnm></au><au><snm>Short</snm><fnm>KR</fnm></au><au><snm>Nair</snm><fnm>KS</fnm></au></aug><source>J Biol Chem</source><pubdate>2000</pubdate><volume>275</volume><issue>5</issue><fpage>3343</fpage><lpage>3347</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1074/jbc.275.5.3343</pubid><pubid idtype="pmpid" link="fulltext">10652323</pubid></pubidlist></xrefbib></bibl><bibl id="B7"><title><p>Accurate whole human genome sequencing using reversible terminator chemistry</p></title><aug><au><snm>Bentley</snm><fnm>DR</fnm></au><au><snm>Balasubramanian</snm><fnm>S</fnm></au><au><snm>Swerdlow</snm><fnm>HP</fnm></au><au><snm>Smith</snm><fnm>GP</fnm></au><au><snm>Milton</snm><fnm>J</fnm></au><au><snm>Brown</snm><fnm>CG</fnm></au><au><snm>Hall</snm><fnm>KP</fnm></au><au><snm>Evers</snm><fnm>DJ</fnm></au><au><snm>Barnes</snm><fnm>CL</fnm></au><au><snm>Bignell</snm><fnm>HR</fnm></au><etal/></aug><source>Nature</source><pubdate>2008</pubdate><volume>456</volume><issue>7218</issue><fpage>53</fpage><lpage>59</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nature07517</pubid><pubid idtype="pmcid">2581791,2581791</pubid><pubid idtype="pmpid" link="fulltext">18987734</pubid></pubidlist></xrefbib></bibl><bibl id="B8"><title><p>Neutral mitochondrial heteroplasmy and the influence of aging</p></title><aug><au><snm>Sondheimer</snm><fnm>N</fnm></au><au><snm>Glatz</snm><fnm>CE</fnm></au><au><snm>Tirone</snm><fnm>JE</fnm></au><au><snm>Deardorff</snm><fnm>MA</fnm></au><au><snm>Krieger</snm><fnm>AM</fnm></au><au><snm>Hakonarson</snm><fnm>H</fnm></au></aug><source>Hum Mol Genet</source><pubdate>2011</pubdate><volume>20</volume><issue>8</issue><fpage>1653</fpage><lpage>1659</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/hmg/ddr043</pubid><pubid idtype="pmcid">3063991</pubid><pubid idtype="pmpid" link="fulltext">21296868</pubid></pubidlist></xrefbib></bibl><bibl id="B9"><title><p>Mitochondrial DNA heteroplasmy in diabetes and normal adults: role of acquired and inherited mutational patterns in twins</p></title><aug><au><snm>Avital</snm><fnm>G</fnm></au><au><snm>Buchshtav</snm><fnm>M</fnm></au><au><snm>Zhidkov</snm><fnm>I</fnm></au><au><snm>Tuval Feder</snm><fnm>J</fnm></au><au><snm>Dadon</snm><fnm>S</fnm></au><au><snm>Rubin</snm><fnm>E</fnm></au><au><snm>Glass</snm><fnm>D</fnm></au><au><snm>Spector</snm><fnm>TD</fnm></au><au><snm>Mishmar</snm><fnm>D</fnm></au></aug><source>Hum Mol Genet</source><pubdate>2012</pubdate></bibl><bibl id="B10"><title><p>Analyzing and minimizing PCR amplification bias in Illumina sequencing libraries</p></title><aug><au><snm>Aird</snm><fnm>D</fnm></au><au><snm>Ross</snm><fnm>MG</fnm></au><au><snm>Chen</snm><fnm>WS</fnm></au><au><snm>Danielsson</snm><fnm>M</fnm></au><au><snm>Fennell</snm><fnm>T</fnm></au><au><snm>Russ</snm><fnm>C</fnm></au><au><snm>Jaffe</snm><fnm>DB</fnm></au><au><snm>Nusbaum</snm><fnm>C</fnm></au><au><snm>Gnirke</snm><fnm>A</fnm></au></aug><source>Genome Biol</source><pubdate>2011</pubdate><volume>12</volume><issue>2</issue><fpage>R18</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/gb-2011-12-2-r18</pubid><pubid idtype="pmcid">3188800</pubid><pubid idtype="pmpid" link="fulltext">21338519</pubid></pubidlist></xrefbib></bibl><bibl id="B11"><title><p>A haplotype map of the human genome</p></title><aug><au><snm>Consortium</snm><fnm>TIHGS</fnm></au></aug><source>Nature</source><pubdate>2005</pubdate><volume>437</volume><issue>7063</issue><fpage>1299</fpage><lpage>1320</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nature04226</pubid><pubid idtype="pmcid">1880871</pubid><pubid idtype="pmpid" link="fulltext">16255080</pubid></pubidlist></xrefbib></bibl><bibl id="B12"><title><p>A map of human genome variation from population-scale sequencing</p></title><aug><au><snm>Durbin</snm><fnm>RM</fnm></au><au><snm>Abecasis</snm><fnm>GR</fnm></au><au><snm>Altshuler</snm><fnm>DL</fnm></au><au><snm>Auton</snm><fnm>A</fnm></au><au><snm>Brooks</snm><fnm>LD</fnm></au><au><snm>Gibbs</snm><fnm>RA</fnm></au><au><snm>Hurles</snm><fnm>ME</fnm></au><au><snm>McVean</snm><fnm>GA</fnm></au></aug><source>Nature</source><pubdate>2010</pubdate><volume>467</volume><issue>7319</issue><fpage>1061</fpage><lpage>1073</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nature09534</pubid><pubid idtype="pmcid">3042601</pubid><pubid idtype="pmpid" link="fulltext">20981092</pubid></pubidlist></xrefbib></bibl><bibl id="B13"><title><p>Mitochondria take center stage in aging and neurodegeneration</p></title><aug><au><snm>Beal</snm><fnm>MF</fnm></au></aug><source>Ann Neurol</source><pubdate>2005</pubdate><volume>58</volume><issue>4</issue><fpage>495</fpage><lpage>505</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1002/ana.20624</pubid><pubid idtype="pmpid" link="fulltext">16178023</pubid></pubidlist></xrefbib></bibl><bibl id="B14"><title><p>Mitochondria as a therapeutic target for aging and neurodegenerative diseases</p></title><aug><au><snm>Reddy</snm><fnm>PH</fnm></au><au><snm>Reddy</snm><fnm>TP</fnm></au></aug><source>Curr Alzheimer Res</source><pubdate>2011</pubdate><volume>8</volume><issue>4</issue><fpage>393</fpage><lpage>409</lpage><xrefbib><pubidlist><pubid idtype="doi">10.2174/156720511795745401</pubid><pubid idtype="pmcid">3295247</pubid><pubid idtype="pmpid" link="fulltext">21470101</pubid></pubidlist></xrefbib></bibl></refgrp>
</bm></art>