Systematic genome sequence differences among leaf cells within individual trees

Diwan, Deepti; Komazaki, Shun; Suzuki, Miho; Nemoto, Naoto; Aita, Takuyo; Satake, Akiko; Nishigaki, Koichi

doi:10.1186/1471-2164-15-142

Methodology article
Open access
Published: 19 February 2014

Systematic genome sequence differences among leaf cells within individual trees

Deepti Diwan¹,
Shun Komazaki¹,
Miho Suzuki¹,
Naoto Nemoto¹,
Takuyo Aita³,
Akiko Satake² &
…
Koichi Nishigaki¹

BMC Genomics volume 15, Article number: 142 (2014) Cite this article

3260 Accesses
10 Citations
5 Altmetric
Metrics details

Abstract

Background

Even in the age of next-generation sequencing (NGS), it has been unclear whether or not cells within a single organism have systematically distinctive genomes. Resolving this question, one of the most basic biological problems associated with DNA mutation rates, can assist efforts to elucidate essential mechanisms of cancer.

Results

Using genome profiling (GP), we detected considerable systematic variation in genome sequences among cells in individual woody plants. The degree of genome sequence difference (genomic distance) varied systematically from the bottom to the top of the plant, such that the greatest divergence was observed between leaf genomes from uppermost branches and the remainder of the tree. This systematic variation was observed within both Yoshino cherry and Japanese beech trees.

Conclusions

As measured by GP, the genomic distance between two cells within an individual organism was non-negligible, and was correlated with physical distance (i.e., branch-to-branch distance). This phenomenon was assumed to be the result of accumulation of mutations from each cell division, implying that the degree of divergence is proportional to the number of generations separating the two cells.

Background

At the beginning of the 21st century, genome sequences of two closely related species, human and chimpanzee, were found to differ by approximately 4% based on conventional genome sequencing technology [1]. With the advent of next-generation sequencing (NGS), it has been established that each person has a unique genome [2]. Within a single organism, genome sequences may be epigenetically different between cells, and sporadic differences are sometimes present between cells from different organs [3]. It is not clear, however, whether each cell within an individual organism possesses a systematically different genome sequence.

Various breakthroughs have been steadily reshaping our understanding of genomes. These advances include accumulating analyses of whole-genome sequences of individuals [4, 5], identification of various non-coding RNAs [6], discovery of the existence of highly repeated sequences [7], and recognition of frequent recombination of genome structures [8, 9]. Recently, an intensive study on the fate of cancerous cells by NGS revealed that lineages of such cells are vigorously mutating [10]. Advanced papers on this topic have subsequently appeared [11, 12].

On the other hand, genome sequence differences have been examined by the copy number variation analysis between normal cells within a single organism [13–15], which informed us of frequent occurrence of mutation in the form of replication slippage at particular genomic loci. In a sense, this is a filtered (i.e., restricted to the tandem repeat sequences) observation of genome alterations. More wide observation of normal genomic DNA is just beginning as can be seen in the recent report [3, 16]. Our study is the first to detect systematic genome sequence differences among cells in single organisms, i.e., within individuals of two woody plant species (Figure 1E).

There has been a hypothesis (genetic mosaicism hypothesis) that long statured plants accumulate spontaneous mutations that expanded among modules (shoots, branches, leaf etc.) and become genetically mosaic as they grow [17]. This hypothesis is explicitly based on the idea of finite spontaneous mutation rate. That is, DNA replication proceeds with limited accuracy, i.e., 10^-6 to 10^-9 errors/base/replication [18] and thus every replicated genome sequence (e.g., the 3 × 10⁹-bp sequence of the human genome) naturally differs from its parental genome. In general, these differences were too small to be directly detected, as they were often below the detection limit of sequencing analysis. Consequently, mutation rate has been conventionally estimated indirectly based on phenotypic changes, such as variation in antibiotic resistance. This situation has been changed by the advent of the NGS (next generation sequencing), enabling the detection of low rate of mutations [19]. However, its application is limited mainly due to high cost and difficulty in data processing [20].

Fortuitously, Genome Profiling (GP) (Figure 2), an easily operable and informative genome analysis method [21–28] is sufficiently competent to detect differences between closely related cells [27, 28]. Compared with conventional sequencing approaches, GP involves two unique procedures (Figure 2): i) collection of DNA fragments from genomic DNA by random PCR [29] and ii) acquisition of DNA sequence information using micro-temperature gradient gel electrophoresis (μTGGE) by separating DNA fragments and observing their melting profiles (Figure 2B) [30–32]. In this method, the property, spiddos (species identification dots), derived from the DNA sequence information [22] plays the pivotal role in identifying a genome and enables us to measure the genome distance (see Methods).

GP has been used as a tool for universal species identification [21, 24, 27, 28, 33] and as an accurate detector of mutation [34, 35]. In this study, we applied the GP method to a new challenge: detection of extremely small genomic differences between very closely related cells with the aim of examining within-organism sequence variation.

Results and discussion

We used Japanese beech (Fagus crenata) trees to examine whether GP was able to reveal if all leaves within a single tree had identical genome sequences (Figures 2 and 3). More specifically, we analyzed sets of sp ecies id entification do ts (spiddos), a pivotal GP parameter derived from genome sequences (Figure 2C), that were obtained from genome profiles, specified by both mobility and melting temperature, both of which are determined after calibration and normalization of band patterns by a computer using co-migrating internal references (see Methods). Although genome profiles (i.e., DNA patterns generated by μTGGE analysis) were not always reproducible because of experimental fluctuations (i.e., environmental temperature, instrumental drift and others), spiddos were highly reproducible as a result of a normalization process that compensated for experimental fluctuations (Figure 2). As shown in Figure 4A, all leaves on the same Japanese beech branch (e.g., A1-1, A1-2, and A1-3, where “A1-2” refers to tree A, branch 1, leaf 2) clustered together. This was also the case for the genome profiles of leaves on branches A2 and A3. Leaves from different branches were found to have different genome sequences. Spiddos of branch A1 and A2 leaves were more similar to one another than to spiddos of leaves on branch A3, located furthest from the ground (Figure 3). Differences were observed in spiddos between leaves belonging to the same branch, but these differences were the level of experimental errors and thus they cannot be said to be significant at this moment [22]. These results reveal that within statistical significance, leaves from individual branches possessed identical genome sequences, but had distinctively different sequences from those of different branches, a finding not previously reported. This result was further confirmed by conducting a similar experiment using different Japanese beech individuals. We also analyzed another species, Yoshino cherry (Prunus × yedoensis), located ~800 km from the site of the Japanese beech trees for more generalized confirmation (Figure 1). Finally, to detect methodological differences, we sequenced a particular DNA band obtained from GP (see in Additional file 1: Figure S1 and for details see Additional file 2). Throughout these experiments, we consistently reached the same conclusion: genome sequences within organisms were not identical, but instead varied systematically.

Figure 4B reveals that very similar results were obtained from the two additional Japanese beech trees. Interestingly, the same relationship trend was observed among all three trees: spiddos of leaves from uppermost branches (A3, B3, and C3) were distinct from spiddos of other leaves (Figure 4B). The cluster dendrogram in Figure 4B was globally constructed based on the whole set of distances (d_G) obtained from all leaf spiddos (Additional file 3: Table S1); consequently, the resulting logically expected structure—leaves on the same branch grouped together and branches on the same tree clustered together—is most impressive and unexpected, demonstrating the effectiveness of this approach. It is therefore evident that genomes of leaves on a tree are neither completely identical to one another nor randomly different but, rather, systematically differ depending on branch location.

As shown in Figure 1, similar results were reproducibly obtained using the other species, Yoshino cherry. Results of cluster analyses of distances (d_G) obtained using spiddos data from three independent GP experiments using the same samples from five branches (Additional file 4: Table S2) are shown in Figure 1A-C; clustering results based on an average of the three trials are shown in Figure 1D. These results of individual experiments (Figure 1A,C) show basically the same pattern as those obtained from the statistically more reliable averages (Figure 1D), indicating that this experimental system has a rather low variance (in other words, a single experiment can provide a good prospect) with only a minor exception: positional exchange of branches 3 and 4 in Figure 1C. The situation observed in Figure 4 (Japanese beech) also held true for Yoshino cherry, i.e., genome profiles of leaves were not identical, but instead differed systematically. In addition, genomes of leaves from the uppermost branch (5–1, 5–2, and 5–3) were genetically distant from leaves of middle branches, indicating a correlation between genomic distance and branch location. The same phenomenon was thus observed in two different, widely separated species, namely, that leaves from the same tree have different genome sequences that can be distinguished using GP.

Our discovery was partially corroborated upon further investigation using direct sequencing. As shown in (see Additional file 1: Figure S1), leaves from the same branch tended to have more closely related sequences, as seen in pairs of leaves from the same Japanese beech branches (B2-2 and B2-3) and (B3-1 and B3-2) in (see Additional file 1: Figure S1B) and from closely located branches of Yoshino cherry (B2-1 and B3-1) and (B4-1 and B5-1) in (see Additional file 1: Figure S1A). Because of missing data caused by generation of artifacts during cloning and sequencing, these results are somewhat equivocal; nonetheless, these data are congruent with the conclusions drawn from the GP experiments. With respect to these direct sequencing results, the experimental procedures used, and sequencing in general, need to be taken into account. DNA fragments generated from the GP experiment were collected by excising their bands from polyacrylamide gels, the most reliable method for obtaining sequences common to both GP and conventional sequencing. Collected DNA was then subjected to cloning and sequencing, two procedures that can introduce mutations. Many spurious sequences were in fact obtained and discarded, including sequences having very low sequence similarity to the primary sequence generated from the DNA band, and sequences of non-plant origin. Although they were within an apparently acceptable range based on sequence consistency (i.e., high similarity), the results shown in (see Additional file 1: Figure S1) were thus subject to limitations inherent to the cloning and sequencing process. Nevertheless, this illustrates one difficulty encountered when using such a clone-isolation- and sequencing-based approach to identify mutation frequencies: the two mutation types—original mutations and sequencing operation-derived mutations (presumably introduced during template preparation, PCR-amplification, sequencing, and base-calling), cannot be distinguished in the final clonal sequencing results. To obtain statistically significant results using conventional high-precision sequencing, high-volume sequencing of the multiple-million base-pair level must be carried out to separate infrequently occurring mutations (e.g., < 10^-6/mutations/base/replication) from background noise. In this regard, it should be noted that the ability of the GP method to overcome this difficulty has been experimentally demonstrated: GP has been used successfully for species identification and classification [24, 25, 27, 28, 36] and in high-sensitivity mutation assays [34, 35].

In this study, we have demonstrated that leaves from the same tree do not have exactly identical genome sequences. This conclusion is expected to be applicable to any multi-celled organism, as DNA is not perfectly replicated in any organism, and thus each genome replication cycle induces mutations that are usually too infrequent to be detected (10^-6 to 10^-9 mutations/base/replication) [18]. In addition, epigenetic methylation of DNA, of which degree must be different from cell to cell and may have a potential to induce base-substitution during PCR, does not effect its PCR amplification [37], which was independently confirmed in our study (Table 1 and Figure 5). Based on the total number of base pairs in the DNA bands obtained by random PCR (i.e., roughly 10 bands, each 1000 bp), we tentatively estimate the GP method has a detection sensitivity of 10^-4 mutations/base/replication. More specifically, the total number of mutations accumulating over g generations, μ(g), can be calculated using the formula:

μ (g) = \sum_{i = 1}^{g} (μ (i) + γ (i)),

(1)

where μ(i) and γ(i) represent replication-dependent and repair-dependent mutation rates, respectively. If we tentatively assume μ(i) = μ_c (a constant) and μ(i) > > γ(i) for all i, then

μ (g) = g . μ_{c}

(2)

Table 1 Reagents used for the DNA methylation and restriction enzyme cleavage

Full size table

This estimate indicates that the GP method cannot detect mutations occurring at a frequency lower than g · μ_c (≤ 10^-4/base). Consequently, leaf genomes must contain a significant number of mutations, equivalent to the sum of replication- and repair-caused mutations. This finding leads us to consider whether the large number of estimated mutations implies that mutation events during replication and repair (a type of 'somatic mutation’) have been unexpectedly frequent [39], or if instead there is a large cell generation difference between tree branches, as follows:

If we assume that μ_c = 10^-8 in the above context, then g, the number of generations, must be

g = \frac{μ (g)}{μ_{c}} \underline{\underline{>}} \frac{10^{- 4}}{10^{- 8}} = 10^{4}

(3)

Because longitudinally tandem consecutive cells expand to the length g' · a, where g' is the number of cell generations and a is the unit cell length, we can calculate the number of cell generations (g') separating two branches. If a = 20 μm and the branch-to-branch distance, B, is 2 m, then

g^{'} = \frac{B}{a} = \frac{2}{(2 \times 10^{- 5})} = 10^{5} > g (= 10^{4})

(4)

and thus from Eq. 2,

μ_{c} = \frac{μ (g)}{g} > \frac{μ (g)}{g^{'}} = \frac{10^{- 4}}{10^{5}} = 10^{- 9}

(5)

Based on this tentative calculation, the apparent genomic distance observed using the GP method, which has a detection limit ≥ 10^-4/base, is within a reasonable range. In other words, the accumulated point mutations are as a consequence of the large generational difference between cells. Obviously, this conclusion needs to be confirmed by other approaches. Our finding regarding this unexpectedly wide genome-to-genome distance will surely collect the interest in this theme which have been less payed with attention.

Except for cancer cells, cells within an individual organism have been previously believed to possess identical genomes. Two brief reports have recently appeared suggesting that cells from a single individual might have different genomes [3, 16], although no hard evidence exists nor has systematic research been performed to confirm those observations. Nevertheless, these reports are consistent with the findings of our study.

Conclusions

The study reported here provided with the first systematic analysis of genome sequence differences among cells in single individuals using the GP method. As a result leaf genome sequences within individual trees were found not to be identical, but varied systematically from the bottom to the top of the tree. Since this phenomenon was detected by the GP method that cannot detect the mutation of less than 10^-5/base/replication, a large number of accumulated mutations must exist between distantly located cells in the tree.

This fact leads to a natural inference that two cells in an individual differ in their genome sequences in relation to their physical distances. In other words, no two cells have completely identical genome sequences. This finding and inference will surely have an influence on the interpretation of various phenomena including mutagens, cancer and others.

Methods

Leaves of Japanese beech (or Buna) (Fagus crenata) trees growing in Sapporo, Japan, and Yoshino cherry (Sakura) (Prunus × yedoensis) trees from Saitama, Japan, were used in this study. The notation A1-2 denotes leaf 2 on branch 1 of tree A. Branch numbers were assigned in the order in which they appeared, beginning from lower (ground) to upper (tree top) levels.

Genomic DNA preparation

After washing leaf samples in 10% sodium dodecyl sulfate (SDS), DNA was extracted using the cetyltrimethylammonium bromide (CTAB) method [40]. Briefly, 100–120-mg samples (wet weight) were homogenized with a mortar and pestle using liquid nitrogen. One milliliter of CTAB solution (200 mM Tris–HCl [pH 9.0], 2% [w/v] CTAB, 2% [w/v] polyvinylpyrrolidone, 0.1% [v/v] 2-mercaptoethanol, 1.4 M NaCl, and 20 mM ethylenediaminetetraacetic acid) was immediately added to the crushed cells, followed by incubation for 1 h at 65°C. After incubation, a 24:1 chloroform-isoamyl alcohol mixture was added; the solution was mixed gently and then centrifuged for 10 min at 12,000 × g (14,000 rpm). This step was repeated twice. An equal volume of propanol was then added to the supernatant, which was centrifuged for 5 min at 16,000 × g (15,000 rpm). In most cases, the pellet obtained was washed with 70% ethanol, centrifuged, and desiccated using an evaporator. Finally, 100 μl of phosphate-buffered saline was added to the precipitate to dissolve the pellet.

GP technology is sufficiently robust such that slight impurities of denatured proteins or polysaccharides will not interfere. Other plant cell components, such as alkaloids and secondary products, can be inhibitory to the PCR reaction, however; consequently, DNA samples were diluted prior to amplification.

Genome profiling (GP)

Genome profiling (GP) uses a set of DNA fragments sampled from genomic DNAs, and is composed of three fundamental steps: random PCR, micro-temperature gradient gel electrophoresis (μTGGE), and data normalization by computer processing [22, 32] (Figure 2). Random PCR can employ arbitrary primers for the PCR reaction because of the relaxed nature of primer binding to template DNA under sufficiently low temperatures. This attribute allows samples of unknown genomic sequence, for which specific primers cannot be designed, to be amplified. As a consequence, DNA fragments from any genomic DNA can be collected independently of the sequence of an oligonucleotide primer used [30, 31] (Note that a single primer is used for random PCR).

Random PCR

Random PCR was performed using primers HUNT (5′-dTGCTGCTGCTGC-3′) and Pfm12 (5′-dAGAACGCGCCTG-3′), which were Cy3-labeled at their 5′ ends. The reaction mixture (25 μl total volume) for random PCR contained 1 ng template DNA, 100 μM primer DNA, 200 μM dNTPs, 10 mM Tris–HCl (pH 9.0), 50 mM KCl, 2.5 mM MgCl₂, and 0.02 unit μl^-1Taq DNA polymerase (Takara Bio Inc., Shiga, Japan). During random PCR, contamination by other organisms should be carefully avoided. To inactivate any contaminating DNAs that could act as a template, the entire random PCR solution, without the template DNA, was therefore UV-irradiated prior to the reaction. Random PCR was carried out using 30 cycles of denaturation (94°C, 30 s), annealing (26°C, 1 min), and extension (47°C, 1 min) on a C1000 thermal cycler (Bio-Rad, Hercules, CA, USA). The second random PCR mixture (50 μl volume) contained 1 μl of the first PCR product as template and the same concentrations of constituents used in the original reaction. The reaction was performed using 10 cycles of denaturation (94°C, 30 s), annealing (60°C, 1 min), and extension (74°C, 1 min). Only 10 cycles were used to ensure that the reaction was terminated before all primer molecules were consumed; this was necessary to guarantee that the major PCR products were in a double-stranded state and thus suitable for TGGE analysis (i.e., so that the melting transition of double-stranded DNA to a single-stranded form can be detected).

μTGGE analysis

For μTGGE, we used a tiny slab gel (24 × 16 × 1 mm³) set on a μ-TG temperature-gradient generator (Taitec, Iruma, Japan) for electrophoresis [32]. Two internal reference DNAs with known melting patterns were co-migrated during each electrophoretic run to calibrate each genome profile, giving highly reproducible results [41]: a 200-bp Ref1 (a 191-bp fragment from the bacteriophage fd gene VIII, sites 1350–1540, attached to a 9-bp sequence, CTACGTCTC, at the 3′ end; T_m = 60°C) and a 900-bp Ref2 taken from a 4361-bp pBR322 fragment (T_m = 61.4°C). Fluorescently-labeled primers MA1 (5′-cy3-dTGCTACGTCTCTTCCGATGCTGTCTTTCGCT-3′) and MA2 (5′-dCCTTGAATTCTATCGGTTTATCA-3′), Ref6F (5′-cy3-dGCCGGCATCACCGGCGCCACAGGTGCGGTTG-3′), and Ref6R (5′-dTAGCGAGGTGCCGCCGGCTTCCATTCAGGTC-3′) were used to generate internal references 1 and 2, respectively. The gel used was 6% polyacrylamide (19:1 acrylamide:bis) containing 500 mM Tris–HCl, 485 mM boric acid, 20 mM EDTA (pH 8.0), and 8 M urea. Approximately 2 μg of DNA was loaded onto the gel and subjected to electrophoresis with a linear temperature gradient of 15 to 65°C for 12 minutes at 100 V cm^-1. After electrophoresis, DNA bands were detected using an FX Molecular Imager fluorescence imager (Bio-Rad).

Computer-aided data analysis

Genome profiles obtained by the GP method are highly informative, but difficult to interpret because of their complexity. To overcome this problem, feature points called spiddos can be introduced [22]. Spiddos correspond to points where DNA structural transitions occur, such as from double-stranded to single-stranded DNA [42]. The coordinates of spiddos are established to be reproducibly obtained by an internal reference-mediated normalization (i.e., the coordinates of the two reference points contained in each GP profile (ref 1 and ref 2, Figure 2C) are used to calibrate the coordinates of the featuring points for same DNAs) which is sequence- and size-dependent.

Using these normalized coordinates, a pattern similarity score (PaSS) between two genomes can be measured as follows:

PaSS = 1 - \frac{1}{n} \sum_{i = 1}^{n} \frac{|{\overset{⇀}{p}}_{i} - {\overset{⇀}{p}}_{i}^{'}|}{|{\overset{⇀}{p}}_{i}| + |{\overset{⇀}{p}}_{i}^{'}|},

(6)

where ${\overset{⇀}{p}}_{i}$ and ${\overset{⇀}{p}}_{i}^{'}$ correspond to the normalized positional vectors (composed of two elements: mobility μ and temperature θ) for spiddos ${\overset{⇀}{p}}_{i}$ and ${\overset{⇀}{p}}_{i}^{'}$ collected from two genome profiles, respectively, and i denotes the spiddo serial number. In general, 0 ≤ PaSS ≤ 1, and thus, 0 ≤ d_G ≤ 1. PaSS is equal to one when two spiddo sets match perfectly.

Genomic distance (d_G), a more practical form, is derived from PaSS as follows:

d_{G} = 1 - PaSS

(7)

If d_G is sufficiently small (<< 1), the two genomes of interest belong to the same species.

Cluster analysis of GP data

To cluster species based on calculated d_G values, we used Ward’s clustering method as implemented in the software program FreeLighter [25, 43, 44].

Sequencing

DNA bands of interest were extracted from TGGE microgels and used as PCR templates in reaction mixtures containing 320 μM dNTPs, 100 μM primer pfm12 (5′-dAGAACGCGCCTG-3′), 10 mM Tris–HCl (pH 9.0), 50 mM KCl, 2.5 mM MgCl₂, and 0.02 unit μl^-1Taq DNA polymerase (Takara). Reaction conditions consisted of 30 cycles of denaturation at 94°C for 30 s, annealing at 60°C for 60 s, and extension at 74°C for 60 s. The resulting random PCR products (DNA) were ligated to pGEM-T Easy vectors (Promega, Madison, WI, USA) at 4°C overnight. Competent cells of E. coli DH5α (Toyobo Co. Ltd., Osaka, Japan) were transformed with the ligation product. Transformed cells were cultivated on LB agar plates (1% tryptone, 0.5% yeast extract, 1% NaCl [pH 7.0], and 1.5% agar) supplemented with ampicillin (10 mg in 200 ml of LB media), 20 μl X-Gal (50 mg ml^-1 in dimethyformamide), and 100 μl of 0.1 M IPTG (isopropylthio-β-galactoside). The agar plates were incubated at 37°C for 12–14 h. White colonies on the plates were selected with a sterile toothpick, transferred to LB broth (1% tryptone, 0.5% yeast extract, and 1% NaCl, pH 7.0; 10 mg ampicillin), and incubated at 37°C for 12–14 h with shaking at about 180 rpm. After confirmation of gene insertion, plasmid DNA was purified using a Wizard Plus SV Minipreps DNA purification system (Promega) and commercially sequenced (Operon Bio-technology, Tokyo, Japan).

Availability of supporting data

Data sets supporting the results of this study are included within the article and its additional files.

Abbreviations

GP:: Genome profiling
NGS:: Next-generation sequencing
μTGGE:: Micro-temperature gradient gel electrophoresis
Spiddos:: Species identification dots
PaSS:: Pattern similarity score
dG:: Genomic distance.

References

Varki A, Nelson DL: Genomic Comparisons of Humans and Chimpanzees. Annu Rev Anthropol. 2007, 36: 191-209. 10.1146/annurev.anthro.36.081406.094339.
Article Google Scholar
Wheeler DA, Srinivasan M, Egholm M, Shen Y, Chen L, McGuire A, He W, Chen Y, Makhijani V, Roth GT, Gomes X, Tartaro K, Niazi F, Turcotte CL, Irzyk GP, Lupski JR, Chinault C, Song XZ, Liu Y, Yuan Y, Nazareth L, Qin X, Muzny DM, Margulies M, Weinstock GM, Gibbs RA, Rothberg JM: The complete genome of an individual by massively parallel DNA sequencing. Nature. 2008, 452: 872-876. 10.1038/nature06884.
Article CAS PubMed Google Scholar
Yong E: Tree’s leaves genetically different from its roots. Nat news. 2012, doi:10.1038/nature.2012.11156
Google Scholar
Kidd JM, Gravel S, Byrnes J, Moreno-Estrada A, Musharoff S, Bryc K, Degenhardt JD, Brisbin A, Sheth V, Chen R, McLaughlin SF, Peckham HE, Omberg L, Chung CAB, Stanley S, Pearlstein K, Levandowsky E, Acevedo-Acevedo S, Auton A, Keinan A, Acuña-Alonzo V, Barquera-Lozano R, Canizales-Quinteros S, Eng C, Burchard EG, Russell A, Reynolds A, Clark AG, Reese MG, Lincoln SE, et al: Population Genetic Inference from Personal Genome Data: Impact of Ancestry and Admixture on Human Genomic Variation. Am J Hum Genet. 2012, 91: 660-671. 10.1016/j.ajhg.2012.08.025.
Article CAS PubMed Central PubMed Google Scholar
Meyer M, Kircher M, Gansauge MT, Li H, Racimo F, Mallick S, Schraiber JG, Jay F, Prüfer K, de Filippo C, Sudmant PH, Alkan C, Fu Q, Do R, Rohland N, Tandon A, Siebauer M, Green RE, Bryc K, Briggs AW, Stenzel U, Dabney J, Shendure J, Kitzman J, Hammer MF, Shunkov MV, Derevianko AP, Patterson N, Andrés AM, Eichler EE, et al: A High-Coverage Genome Sequence from an Archaic Denisovan Individual. Science. 2012, 338: 222-226. 10.1126/science.1224344.
Article CAS PubMed Central PubMed Google Scholar
Guttman M, Amit I, Garber M, French C, Lin MF, Feldser D, Huarte M, Zuk O, Carey BW, Cassady JP, Cabili MN, Jaenisch R, Mikkelsen TS, Jacks T, Hacohen N, Bernstein BE, Kellis M, Regev A, Rinn JL, Lander ES: Chromatin signature reveals over a thousand highly conserved large non-coding RNAs in mammals. Nature. 2009, 458: 223-227. 10.1038/nature07672.
Article CAS PubMed Central PubMed Google Scholar
Jurka J, Kapitonov VV, Kohany O, Jurka MV: Repetitive Sequences in complex genomes: Structure and Evolution. Annu Rev Genomics Hum Genet. 2007, 8: 241-259. 10.1146/annurev.genom.8.080706.092416.
Article CAS PubMed Google Scholar
Saito A, Nishigaki K: Homogenization of Chromosomes Revealed by Oligonucleotide-Stickiness. J Comput Chem Jpn. 2004, 3: 145-152. 10.2477/jccj.3.145.
Article CAS Google Scholar
McVean GA, Myers SR, Hunt S, Deloukas P, Bentley DR, Donnelly P: The Fine-Scale Structure of Recombination Rate Variation in the Human Genome. Science. 2004, 304: 581-584. 10.1126/science.1092500.
Article CAS PubMed Google Scholar
Navin N, Kendall J, Troge J, Andrews P, Rodgers L, McIndoo J, Cook K, Stepansky A, Levy D, Esposito D, Muthuswamy L, Krasnitz A, McCombie WR, Hicks J, Wigler M: Tumor evolution inferred by single-cell sequencing. Nature. 2011, 472: 90-94. 10.1038/nature09807.
Article CAS PubMed Central PubMed Google Scholar
Li GW, Xie XS: Central dogma at the single-molecule level in living cells. Nature. 2011, 475: 308-315. 10.1038/nature10315.
Article CAS PubMed Central PubMed Google Scholar
Blanpain C: Tracing the cellular origin of cancer. Nat Cell Biol. 2013, 15: 126-134. 10.1038/ncb2657.
Article CAS PubMed Google Scholar
Piotrowski A, Bruder CE, Andersson R, Diaz de Ståhl T, Menzel U, Sandgren J, Poplawski A, von Tell D, Crasto C, Bogdan A, Bartoszewski R, Bebok Z, Krzyzanowski M, Jankowski Z, Partridge EC, Komorowski J, Dumanski JP: Somatic mosaicism for copy number variation in differentiated human tissues. Hum Mutat. 2008, 29: 1118-1124. 10.1002/humu.20815.
Article PubMed Google Scholar
Liang Q, Conte N, Skarnes WC, Bradley A: Extensive genomic copy number variation in embryonic stem cells. Proc Natl Acad Sci U S A. 2008, 105: 17453-17456. 10.1073/pnas.0805638105.
Article CAS PubMed Central PubMed Google Scholar
O'Huallachain M, Karczewski KJ, Weissman SM, Urban AE, Snyder MP: Extensive genetic variation in somatic human tissues. Proc Natl Acad Sci U S A. 2012, 109: 18018-18023. 10.1073/pnas.1213736109.
Article PubMed Central PubMed Google Scholar
Grant B: DNA may differ between tissues. The Scientist. 2009,http://www.the-scientist.com/?articles.view/articleNo/27529/title/DNA-may-differ-between-tissues,
Google Scholar
Gill DE, Chao L, Perkins SL, Wolf JB: Genetic mosaicism in plants and clonal animals. Annu Rev Ecol Syst. 1995, 26: 423-444. 10.1146/annurev.ecolsys.26.1.423.
Article Google Scholar
Eckert KA, Kunkel TA: DNA polymerase Fidelity and the Polymerase Chain Reaction. Genome Res. 1991, 1: 17-24. 10.1101/gr.1.1.17.
Article CAS Google Scholar
Schmitt MW, Kennedy SR, Salk JJ, Fox EJ, Hiatt JB, Loeb LA: Detection of ultra-rare mutations by next-generation sequencing. Proc Natl Acad Sci U S A. 2012, 109: 14508-14513. 10.1073/pnas.1208715109.
Article CAS PubMed Central PubMed Google Scholar
Richter BG, Sexton DP: Managing and Analyzing Next-Generation Sequence Data. PLoS Comput Biol. 2009, 5: e1000369-10.1371/journal.pcbi.1000369.
Article PubMed Central PubMed Google Scholar
Nishigaki K, Naimuddin M, Hamano K: Genome Profiling: a Realistic Solution for Genotype-Based Identification of Species¹. J Biochem. 2000, 128: 107-112. 10.1093/oxfordjournals.jbchem.a022719.
Article CAS PubMed Google Scholar
Naimuddin M, Kurazono T, Zhang Y, Watanabe T, Yamaguchi M, Nishigaki K: Species-identification dots: a potent tool for developing genome microbiology. Gene. 2000, 261: 243-250. 10.1016/S0378-1119(00)00502-3.
Article CAS PubMed Google Scholar
Naimuddin M, Nishigaki K: Genome analysis technologies: Towards species identification by genotype. Brief Funct Genomic Proteomic. 2003, 1: 356-371. 10.1093/bfgp/1.4.356.
Article CAS PubMed Google Scholar
Kouduka M, Matuoka A, Nishigaki K: Acquisition of genome information from single-celled unculturable organisms (radiolaria) by exploiting genome profiling (GP). BMC Genomics. 2006, 7: 135-10.1186/1471-2164-7-135.
Article PubMed Central PubMed Google Scholar
Kouduka M, Sato D, Komori M, Kikuchi M, Miyamoto K, Kosaku A, Naimuddin M, Matsuoka A, Nishigaki K: A Solution for Universal Classification of Species Based on Genomic DNA. Int J Plant Genomics. 2007, 2007: 27894-
Article PubMed Central Google Scholar
Takasaka T, Sakurada K, Akutsu T, Nishigaki K, Ikegaya H: Trials of the detection of semen and vaginal fluid RNA using the genome profiling method. Leg Med (Tokyo). 2011, 13: 265-267. 10.1016/j.legalmed.2011.05.001.
Article CAS Google Scholar
Ahmed S, Komori M, Tsuji-Ueno S, Suzuki M, Kosaku A, Miyamoto K, Nishigaki K: Genome Profiling (GP) Method Based Classification of Insects: Congruence with That of Classical Phenotype-Based One. PLoS ONE. 2011, 6: e23963-10.1371/journal.pone.0023963.
Article CAS PubMed Central PubMed Google Scholar
Hamano K, Ueno-Tsuji S, Tanaka R, Suzuki M, Nishimura K, Nishigaki K: Genome profiling (GP) as an effective tool for monitoring culture collections: A case study with Trichosporon. J Microbiol Methods. 2012, 89: 119-128. 10.1016/j.mimet.2012.02.007.
Article CAS PubMed Google Scholar
Sakuma Y, Nishigaki K: Computer Prediction of General PCR Products Based on Dynamical Solution Structures of DNA¹. J Biochem. 1994, 116: 736-741.
CAS PubMed Google Scholar
Nishigaki K, Miura T, Tsubota M, Sutoh A, Amano N, Husimi Y: Structural analysis of nucleic acids by precise denaturing gradient gel electrophoresis: II. Applications to the analysis of subtle and drastic mobility changes of oligo- and polynucleotides¹. J Biochem. 1992, 111: 151-156.
CAS PubMed Google Scholar
Nishigaki K, Saito A, Takashi H, Naimuddin M: Whole genome sequence-enabled prediction of sequences performed for random PCR products of Escherichia coli. Nucleic Acids Res. 2000, 28: 1879-1884. 10.1093/nar/28.9.1879.
Article CAS PubMed Central PubMed Google Scholar
Biyani M, Nishigaki K: Hundredfold productivity of genome analysis by introduction of microtemperature-gradient gel electrophoresis. Electrophoresis. 2001, 22: 23-38. 10.1002/1522-2683(200101)22:1<23::AID-ELPS23>3.0.CO;2-Z.
Article CAS PubMed Google Scholar
Watanabe T, Saito A, Takeuchi Y, Naimuddin M, Nishigaki K: A database for the provisional identification of species using only genotypes: web-based genome profiling. Genome Biol. 2002, 3: research0010.1-research0010.8.
Article Google Scholar
Futakami M, Salimullah M, Miura T, Tokita S, Nishigaki K: Novel Mutation Assay with High Sensitivity based on Direct Measurement of Genomic DNA Alterations: Comparable Results to the Ames Test. J Biochem. 2007, 141: 675-686. 10.1093/jb/mvm074.
Article CAS PubMed Google Scholar
Futakami M, Nishigaki K: Measurement of DNA Mutations Caused by Seconds-period UV-irradiation. Chem Lett. 2007, 36: 358-359. 10.1246/cl.2007.358.
Article CAS Google Scholar
Oda H, Hatakeyama Y, Iwano H: Phylogenetic relationships among Bacillus thuringiensis(Bacillaceae: Bacillales) strains based on a comparison of SSU rRNA sequences and genome profiling. Appl Entomol Zool. 2011, 46: 489-496. 10.1007/s13355-011-0068-4.
Article Google Scholar
Flusberg BA, Webster DR, Lee JH, Travers KJ, Olivares EC, Clark TA, Korlach J, Turner SW: Direct detection of DNA methylation during single-molecule, real-time sequencing. Nat Methods. 2010, 7: 461-465. 10.1038/nmeth.1459.
Article CAS PubMed Central PubMed Google Scholar
Proffitt JH, Davie JR, Swinton D, Hattman S: 5-Methylcytosine Is Not Detectable in Saccharomyces cerevisiae DNA. Mol Cell Biol. 1984, 4: 985-988.
Article CAS PubMed Central PubMed Google Scholar
Klekowski EJ, Godfrey PJ: Ageing and mutation in plants. Nature. 1989, 340: 389-391. 10.1038/340389a0.
Article Google Scholar
Ibrahim RIH: A modified CTAB protocol for DNA extraction from young flower petals of some medicinal plant species. Gene Conserve. 2011, 10: 165-182.
Google Scholar
Nishgaki K, Tsubota M, Miura T, Chonan Y, Husimi Y: Structural analysis of nucleic acids by precise denaturing gradient gel electrophoresis: I. Methodology. J Biochem. 1992, 111: 144-150.
Google Scholar
Lilley D, Dahlberg J: DNA Structures Part B: Chemical and Electrophoretic Analysis of DNA. Methods in Enzymology. Edited by: Abrams ES, Stanton VPJr. 1992, 71-104. Volume 212
Google Scholar
Ward JH: Hierarchical grouping to optimize an objective function. J Am Stat Assoc. 1963, 58: 236-244. 10.1080/01621459.1963.10500845.
Article Google Scholar
Ahmed S, Nishigaki K: Error-Robust Nature of Genome Profiling Applied for Clustering of Species Demonstrated by Computer Simulation. Int J Biol Life Sci. 2007, 3: 82-88.
Google Scholar

Download references

Acknowledgements

We are grateful to Dr. Hisao Honda (Hyogo University) for his scientific suggestions and encouragement. We thank Dr. Robert L. Fischer (University of California) for his valuable advice. This study was partly supported by a grant from the City Area Program (Saitama Metropolitan Area) from the Ministry of Education, Culture, Sports, and Technology (MEXT).

Author information

Authors and Affiliations

Graduate School of Science and Engineering, Department of Functional Materials Science, Saitama University, Saitama, 338-8570, Japan
Deepti Diwan, Shun Komazaki, Miho Suzuki, Naoto Nemoto & Koichi Nishigaki
Department of Science, Hokkaido University, Sapporo, Japan
Akiko Satake
Graduate School of Information Science and Technology, Osaka University, Suita, Japan
Takuyo Aita

Authors

Deepti Diwan
View author publications
You can also search for this author in PubMed Google Scholar
Shun Komazaki
View author publications
You can also search for this author in PubMed Google Scholar
Miho Suzuki
View author publications
You can also search for this author in PubMed Google Scholar
Naoto Nemoto
View author publications
You can also search for this author in PubMed Google Scholar
Takuyo Aita
View author publications
You can also search for this author in PubMed Google Scholar
Akiko Satake
View author publications
You can also search for this author in PubMed Google Scholar
Koichi Nishigaki
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Koichi Nishigaki.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors’ contributions

SK carried out initial experiments to establish the overall experimental protocol, and collected and processed GP data. DD performed GP analyses and supplementary experiments, including methylation experiments, finalized the study, and wrote the manuscript. MS, NN, and TA analyzed the data; AS helped analyze the data and collected samples. KN designed and directed the study, analyzed data, and helped write and edit the manuscript. All authors read and approved the final manuscript.

Deepti Diwan, Shun Komazaki contributed equally to this work.

Electronic supplementary material

12864_2013_5773_MOESM1_ESM.tiff

Additional file 1: Figure S1: Sequence-based clustering of leaves from (A) Yoshino cherry and (B) Japanese beech trees. Only sequence data that could be consistently assigned were used. Clustering was performed using Consensus Maker v2.0.0 (http://www.hiv.lanl.gov/content/sequence/CONSENSUS/consensus.html). Yoshino cherry tree leaf number designations are arbitrary. (TIFF 105 KB)

12864_2013_5773_MOESM2_ESM.docx

Additional file 2:DNA consensus sequence data of leaves used for analysis were derived using Consensus Maker v2.0.0 (http://www.hiv.lanl.gov/content/sequence/CONSENSUS/consensus.html), and then used to construct clustering tree. (These sequences are deposited in GenBank database: KJ411230-KJ411277). (DOCX 24 KB)

12864_2013_5773_MOESM3_ESM.xlsx

Additional file 3: Table S1: Genome distances (d_G) among genomes of Japanese beech tree leaves. Each value is the average of two independent experiments. (XLSX 15 KB)

12864_2013_5773_MOESM4_ESM.xlsx

Additional file 4: Table S2: Genome distances (d_G) among genomes of Yoshino Cherry tree leaves. Each value is the average of three independent experiments. (XLSX 12 KB)

Authors’ original submitted files for images

Below are the links to the authors’ original submitted files for images.

Authors’ original file for figure 1

Authors’ original file for figure 2

Authors’ original file for figure 3

Authors’ original file for figure 4

Authors’ original file for figure 5

Rights and permissions

Open Access This article is published under license to BioMed Central Ltd. This is an Open Access article is distributed under the terms of the Creative Commons Attribution License ( https://creativecommons.org/licenses/by/2.0 ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver ( https://creativecommons.org/publicdomain/zero/1.0/ ) applies to the data made available in this article, unless otherwise stated.

Reprints and permissions

About this article

Cite this article

Diwan, D., Komazaki, S., Suzuki, M. et al. Systematic genome sequence differences among leaf cells within individual trees. BMC Genomics 15, 142 (2014). https://doi.org/10.1186/1471-2164-15-142

Download citation

Received: 22 September 2013
Accepted: 10 February 2014
Published: 19 February 2014
DOI: https://doi.org/10.1186/1471-2164-15-142

Systematic genome sequence differences among leaf cells within individual trees

Abstract

Background

Results

Conclusions

Background

Results and discussion

Conclusions

Methods

Genomic DNA preparation

Genome profiling (GP)

Random PCR

μTGGE analysis

Computer-aided data analysis

Cluster analysis of GP data

Sequencing

Availability of supporting data

Abbreviations

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Competing interests

Authors’ contributions

Electronic supplementary material

Authors’ original submitted files for images

Rights and permissions

About this article

Cite this article

Share this article

Keywords

BMC Genomics

Contact us