A kinetic-based sigmoidal model for the polymerase chain reaction and its application to high-capacity absolute quantitative real-time PCR

Rutledge, Robert G; Stewart, Don

doi:10.1186/1472-6750-8-47

Methodology article
Open access
Published: 08 May 2008

A kinetic-based sigmoidal model for the polymerase chain reaction and its application to high-capacity absolute quantitative real-time PCR

Robert G Rutledge¹ &
Don Stewart¹

BMC Biotechnology volume 8, Article number: 47 (2008) Cite this article

22k Accesses
173 Citations
Metrics details

Abstract

Background

Based upon defining a common reference point, current real-time quantitative PCR technologies compare relative differences in amplification profile position. As such, absolute quantification requires construction of target-specific standard curves that are highly resource intensive and prone to introducing quantitative errors. Sigmoidal modeling using nonlinear regression has previously demonstrated that absolute quantification can be accomplished without standard curves; however, quantitative errors caused by distortions within the plateau phase have impeded effective implementation of this alternative approach.

Results

Recognition that amplification rate is linearly correlated to amplicon quantity led to the derivation of two sigmoid functions that allow target quantification via linear regression analysis. In addition to circumventing quantitative errors produced by plateau distortions, this approach allows the amplification efficiency within individual amplification reactions to be determined. Absolute quantification is accomplished by first converting individual fluorescence readings into target quantity expressed in fluorescence units, followed by conversion into the number of target molecules via optical calibration. Founded upon expressing reaction fluorescence in relation to amplicon DNA mass, a seminal element of this study was to implement optical calibration using lambda gDNA as a universal quantitative standard. Not only does this eliminate the need to prepare target-specific quantitative standards, it relegates establishment of quantitative scale to a single, highly defined entity. The quantitative competency of this approach was assessed by exploiting "limiting dilution assay" for absolute quantification, which provided an independent gold standard from which to verify quantitative accuracy. This yielded substantive corroborating evidence that absolute accuracies of ± 25% can be routinely achieved. Comparison with the LinReg and Miner automated qPCR data processing packages further demonstrated the superior performance of this kinetic-based methodology.

Conclusion

Called "linear regression of efficiency" or LRE, this novel kinetic approach confers the ability to conduct high-capacity absolute quantification with unprecedented quality control capabilities. The computational simplicity and recursive nature of LRE quantification also makes it amenable to software implementation, as demonstrated by a prototypic Java program that automates data analysis. This in turn introduces the prospect of conducting absolute quantification with little additional effort beyond that required for the preparation of the amplification reactions.

Background

Of the many attributes of real-time quantitative PCR (qPCR), the ability to conduct absolute quantification is arguably the most significant. From a technical perspective, absolute quantification allows assay performance to be precisely defined, from which sensitivity, effective quantitative range and quantitative accuracy can be expressed in absolute terms. From an application perspective, assessing biological significance within the context of absolute number of target molecules can enhance the utility of most, if not all, quantitative assays. Many prominent examples come from biomedical diagnostics where absolute quantification can have direct clinical relevancy, as is evident for monitoring viral load and residual disease. Although more general applications such as environmental screening and pathogen detection would also benefit greatly, it is the application of absolute qPCR to gene expression profiling that holds some of the most substantive implications.

Historically, real-time qPCR has largely been relegated to a supportive role in large-scale gene expression studies, most frequently used for the verification of DNA microarray datasets [1–3]. Nevertheless, absolute qPCR has the potential to extend gene expression analysis beyond what is possible with microarray analysis, based upon the innate capability to overcome two of the greatest limitations of microarray quantification, which are limited sensitivity and lack of absolute scale [4]. Some of the most illustrative examples come from the application of microarrays to clinical research and diagnostics. Founded upon the expectation that gene expression analysis can be used as a diagnostic tool to both predict and follow therapeutic outcomes [4, 5], many studies have reported that effective diagnoses can be achieved with relatively small groups of biomarker transcripts, numbering between 10 and 100 [6, 7], a range that is potentially within the capacity of absolute qPCR technologies.

Development of diagnostic assays for disease prediction could thus exploit the reduced technical complexity, speed of analysis, sensitivity and substantively greater resolution provided by real-time qPCR, as compared with microarray analysis [8]. Indeed, absolute quantification could increase the efficacy of any gene expression profiling initiative, irrespective of the experimental context. Nevertheless, a prominent inadequacy of current real-time qPCR technologies is the limited capacity for conducting absolute quantification, due to reliance on target-specific standard curves [9]. Not only does this necessitate preparation of a quantified standard for each target under investigation, the technical difficulties and extensive resources required for standard curve construction present considerable challenges for conducting absolute quantification, even for a modest number of targets.

A number of studies have attempted to overcome the innate limitations of standard curves by analyzing the fluorescence readings generated by individual amplification reactions. These include determining amplification efficiency through the application of exponential mathematics to the log-linear region, using either linear regression [10–13] or nonlinear regression [14, 15]. Attempts to model the entire amplification profile have included sigmoidal modeling using nonlinear regression [16–20], in addition to the application of various biochemical-based models [21–23], and other forms of mathematical modeling [24–26]. Demonstration that absolute quantification can be achieved by combining optical calibration with sigmoidal modeling (a method called "sigmoidal curve fitting" or SCF [18]) has led several groups to evaluate this approach as an alternative to standard curve-based quantification [27–33]. Unfortunately, effective implementation of SCF has been impeded by errors produced by distortions within the plateau phase, which can severely compromise the accuracy of SCF-based quantification [18, 31–33].

The study presented here extends SCF quantification by adapting the sigmoid function upon which SCF is based, to directly model PCR amplification without the need to conduct nonlinear regression. Based upon a linear relationship between amplification rate and amplicon quantity, this allows target quantification to be conducted using linear regression analysis. In addition to eliminating errors produced by plateau phase anomalies, this provided the foundation for development of a new quantitative paradigm that does not require standard curves and is able to resolve quantitative differences on the order of 0.5 fold, while providing unprecedented quality control capabilities. A prototypic Java program which automates implementation of this kinetic-based methodology further illustrates the potential to develop high-throughput applications, in addition to providing a visual illustration of the underlying principles.

Results

Amplification efficiency is dynamic and is coupled to amplicon DNA quantity

New insights into the dynamics of PCR amplification have been gained through the application of sigmoidal modeling [11, 18], in which nonlinear regression analysis is used to fit real-time SYBR Green I fluorescence readings to the sigmoid function:

F_{C} = \frac{F_{\max}}{1 + e^{- (\frac{C - C_{1 / 2}}{k})}} + F_{b}

(1)

where F_Cis the reaction fluorescence at cycle C and is proportional to the mass of amplicon DNA present in the reaction, F_maxis the maximal reaction fluorescence that defines the end point of the amplification process, referred to as the plateau phase, C_1/2 is the fractional cycle at which reaction fluorescence reaches half of F_max, k is related to the slope of the curve, and F_bis the fluorescence background. Despite its apparent novelty, this equation is in fact a simple derivative of the classic Boltzmann four-parametric sigmoid function that is commonly used to model sigmoidal datasets.

Notwithstanding the complexities of conducting nonlinear regression analysis, the remarkable precision that can be achieved is indicative of the potential for sigmoidal modeling to fundamentally revolutionize real-time qPCR [18]. An essential insight into this potential comes from examination of PCR amplification kinetics, as described by a second sigmoid function [11]:

E_{C} = \frac{1 + e^{- (\frac{C - 1 - C_{1 / 2}}{k})}}{1 + e^{- (\frac{C - C_{1 / 2}}{k})}} - 1

(2)

where E_Cis the amplification efficiency at cycle C, also referred to as "cycle efficiency" [18]. Under a sigmoidal model, amplification rate is maximal at the onset of thermocycling, but progressively decreases such that each cycle has a unique amplification efficiency, with entry into the plateau phase occurring as amplification efficiency approaches zero.

The principle of this and other insights can be illustrated by comparing plots generated with equations 1 and 2, the most notable being the striking symmetry between amplicon DNA accumulation and reduction in cycle efficiency (Figure 1A). Importantly, this implies that an association exists between amplicon DNA quantity and amplification efficiency, a contention supported by the mathematical prediction of a linear relationship between reaction fluorescence and cycle efficiency (Figure 1B).

Recognition of this linear relationship not only impacts the practicalities of modeling PCR amplification, but also has unparalleled implications for how real-time qPCR can be implemented. Central to this is the prediction that the dynamics of PCR amplification can be described by a linear equation, defined here as:

E_c= ΔE × F_c+ E_max (3)

where the slope defines the rate of loss in cycle efficiency (ΔE) and the intercept defines the maximal amplification efficiency (E_max) when F_C= 0 (Figure 1B). It is also evident that as PCR amplification enters the plateau phase, reaction fluorescence approaches a maximum (F_max) as E_Capproaches zero, such that equation (3) becomes:

0 = ΔE × F_max + E_max

so that:

F_{\max} = \frac{E_{\max}}{- Δ E}

(4)

In addition to greatly simplifying the mathematics describing amplification kinetics, of practical significance is the ability to obtain estimates of E_maxand ΔE via linear regression analysis, utilizing the fluorescence readings produced by individual PCR reactions. Termed "linear regression of efficiency" or LRE, this approach not only generates a linear representation of PCR amplification, but as described in later sections, allows target quantity to be determined directly from individual fluorescence readings. An important qualification, however, is the extent to which experimental data comply with these mathematical predictions.

Conformity of real-time amplification profiles generated with SYBR Green I

A fundamental approach to analyzing the kinetics of PCR amplification is based upon defining amplification efficiency as the relative increase in amplicon DNA over a single cycle:

E_{C} = \frac{F_{C}}{F_{C - 1}} - 1

(5)

where F_C-1is the reaction fluorescence of the preceding cycle. This provides an estimate of cycle efficiency from which kinetic analysis can be conducted without having to resort to nonlinear regression analysis. It should be noted that Alvarez et al. (2007) [19] have utilized a similar approach for determining E_max(which they refer to as the "intrinsic" amplification efficiency), although they concluded that a two-parametric sigmoid function was superior to the linear model upon which LRE is based. Nevertheless, the validity of equation 5 is dependent on the assumption that reaction fluorescence remains proportional to amplicon DNA quantity throughout the amplification process. While this may be most evident for fluorescent dyes such as SYBR Green I, other detection chemistries may not conform well to this assumption (e.g. see Swillens et al. (2004) [34]). In fact SYBR Green I detection may not be free of anomalies, particularly at high amplicon DNA quantities. This presumption is based upon the low quantities of SYBR Green I present in commercial real-time PCR formulations (estimated to be 0.1–0.2X), made necessary by the inhibitory nature of SYBR Green I. This in turn could distort the apparent cycle efficiency as predicted by equation 5, potentially invalidating, or at least distorting, LRE-based estimates of E_maxand ΔE.

A major goal of this study was therefore to evaluate the efficacy of sigmoidal modeling for conducting absolute quantification, with the explicit objective of validating the quantitative competency of a LRE-based methodology; that is, beyond simply generating target quantities that lack context. Central to this initiative was the utilization of bacteriophage lambda genomic DNA (lambda gDNA) as a highly defined, commercially available quantitative standard. Not only does this approach make much of this work amenable to experimental replication, it serves to illustrate some of the exceptional attributes of employing lambda gDNA as a universal quantitative standard. Paramount is the ability to establish quantitative scale, which is essential to conducting absolute quantification. As will be illustrated in later sections, this is also requisite to eliminating the need for standard curves.

Notwithstanding the potential utility of LRE-based quantification, an essential starting point was to examine the potential distortion of reaction fluorescence generated by low quantities of SYBR Green I. It was initially surmised that increasing SYBR Green I quantity would be informative; however, the sensitivity of QuantiTect (a T. aquaticus DNA polymerase (Taq) based formulation) to SYBR Green I inhibition precluded the ability to apply anything but a moderate increase in SYBR Green I quantity. Subsequently, two alternative non-Taq formulations were evaluated, based on the speculation that they could be more resilient to higher quantities of SYBR Green I. These were DyNAmo, formulated with an engineered T. brockianus DNA polymerase fused to a non-specific DNA-binding region, and FullVelocity, formulated with an unspecified archaeal DNA polymerase.

As summarized in Figure 2, three series of replicate amplification reactions supplemented with progressively greater quantities of SYBR Green I demonstrate, based on similarity in amplification profile position, that addition of 1-2X SYBR Green I was marginally inhibitory to both DyNAmo and FullVelocity. In contrast, QuantiTect was greatly impacted by addition of 0.8X SYBR Green I, as reflected by the large shift in position, and extensive scattering, of the replicate amplification profiles (Figure 2A). More significant, however, is the substantive increase in F_max produced by increasing SYBR Green I, establishing that reaction fluorescence intensity is dependent on SYBR Green I quantity. Notwithstanding this dependency, increasing SYBR Green I quantity had no apparent impact on the general sigmoidal shape of the amplification profiles. The amplification profiles produced by DyNAmo and FullVelocity further indicate that the sigmoidal shape of SYBR Green I amplification profiles is not unique to Taq.

The sigmoidal character of these amplification profiles was further substantiated by LRE analysis, which revealed a linear domain corresponding to the central region of each amplification profile (Figure 2B, D and 2F). These plots further reveal that the fluorescence readings generated by later cycles do not conform well to the LRE model, as reflected by the "spilling" of points off of the LRE line. Importantly, this is consistent with what was previously observed during development of the SCF method, where anomalies associated with the plateau phase were common, and significantly distorted nonlinear regression analyses. This subsequently required exclusion of the plateau cycles from the nonlinear regression, based upon a somewhat esoteric method for selecting a "cutoff cycle" [18]. Nevertheless, as is apparent in the LRE plots presented in Figure 2, a kinetic-based approach provides the ability to identify anomalous fluorescence readings based on loss of conformity with the LRE model.

Thus, while testing of the quantitative capabilities of LRE analysis requires additional tools, this initial assessment does provide substantive evidence that SYBR Green I real-time profiles can conform well to that predicted by sigmoidal modeling. Furthermore, these results suggest that despite the apparent complexities of the classic Boltzmann sigmoid function, it should be possible to develop a sigmoidal model for the polymerase chain reaction, derived from the two kinetic parameters predicted by the LRE model to govern PCR amplification.

Derivation of a kinetic-based sigmoid model for the polymerase chain reaction

Conformity of PCR amplification to the classic Boltzmann sigmoid function (equation 1) poses the question as to how C_1/2, k and F_maxrelate to ΔE and E_max. Although equation (4) predicts that F_maxis defined by the ratio of E_maxto ΔE, it is less clear how k and C_1/2 relate. As summarized in additional file 1, C_1/2 and k can be eliminated through a series of rearrangements and substitutions, producing two functions that allow modeling of PCR amplification based solely on ΔE and E_max. On the assumption that the fluorescence background (F_b) is zero, these are:

F_{0} = \frac{F_{\max}}{1 + (\frac{F_{m a x}}{F_{C}} - 1) {(E_{\max} + 1)}^{C}}

(6)

F_{C} = \frac{F_{\max}}{1 + (\frac{F_{m a x}}{F_{0}} - 1) {(E_{\max} + 1)}^{- C}}

(7)

Thus, once values for ΔE and E_maxhave been obtained via LRE analysis, equation 6 can be used to convert individual F_C readings into target quantity expressed in fluorescence units (F₀). Target quantification is then based on averaging the F₀ values derived from the cycles used in LRE analysis, followed by conversion into the number of target molecules via optical calibration. Furthermore, once an average F₀ value has been obtained, the corresponding amplification profile can be modeled by using equation 7 to calculate predicted F_C values for each cycle. As illustrated in the next section, this series of computations is capable of modeling PCR amplification to a very high degree of precision, and without the need to conduct nonlinear regression analysis.

It should noted that Chervoneva et al. (2007) [20] have recently described modeling of real-time PCR using a logistic function identical to equation 7. However, the primary objective of their study was to determine E_maxfor relative quantification, based on nonlinear regression analysis. This contrasts the kinetic approach taken in this study, in which absolute quantification is based on converting fluorescence readings into target quantity using equation 6.

Implementation of LRE quantification

Initial setup

With equations 6 and 7 in hand, the practical and analytical capabilities of LRE quantification can be tested. Furthermore, the computational simplicity of the methodology makes it amenable to manual implementation using a spread sheet. Additional data file 2 contains the MS Excel templates used for the data analysis conducted in this study. The general approach involves a series of steps that culminates in a recursive process in which the conformity of individual fluorescence readings is used to optimize the analysis.

Fluorescence readings are first imported into the spreadsheet and background fluorescence subtracted, as estimated by averaging 6–12 baseline cycles (i.e. before amplicon DNA becomes detectable). Replicate F_C datasets are then averaged (see below), cycle efficiency (E_C) calculated for each cycle using equation 5, and E_C plotted against F_C to generate what is called the "LRE plot" (Figure 2B, D and 2F). A contiguous group of points is then selected for linear regression analysis, from which estimates of ΔE (slope) and E_max(intercept) are obtained, a process called "LRE analysis". Referred to as the "LRE window", this region is elemental to defining the sigmoidal model of an amplification profile. The linear integrity of the LRE window is thus crucial to the quantitative accuracy of the assay.

Various methods for selecting the LRE window size and position have been tested. In practice, details of LRE window selection for many amplicons are somewhat inconsequential, due to the high degree of conformity that can be generated. Nevertheless, as noted earlier, later cycles of some amplification profiles do not conform well to sigmoidal modeling. This is apparent in Figure 2B, D and 2F in which points corresponding to cycles within the upper region of each amplification profile "drift" off the LRE line. A key objective is therefore to avoid inclusion of these non-conforming cycles into the LRE window, while at the same time selecting the largest possible LRE window in order to maximize the precision of the linear regression analysis.

A logical starting point is to initially select a small LRE window positioned within the lower region of an amplification profile, and to progressively add consecutive cycles to the LRE window (i.e. to expand the upper limit of the LRE window), repeating the linear regression analysis as each new cycle is added to the LRE window. This is continued until encountering a cycle that clearly does not conform, based upon divergence from the LRE line. Although this approach can be reasonably effective, a more objective method that has proven to be both simpler and more sensitive becomes evident during assessment of LRE quantitative precision. However, before this can be described, another foundational principle of LRE quantification must first be implemented.

Target quantification, recursive analysis and the precision of LRE modeling

A fundamental attribute of LRE quantification, as exemplified by equation 6, is the ability to convert individual F_C readings into a target quantity, once estimates for ΔE and E_maxhave been obtained. Indeed, several intriguing behaviors become evident when this is applied across an entire amplification profile, generating what is referred to as the "F₀ plot". The most notable is that F₀ values encompassed by the LRE window are very similar, a trend that can extend to F_C readings close to the baseline fluorescence (Figure 3). In addition to providing multiple estimates of target quantity from each amplification profile, this illustrates the extraordinary precision that LRE modeling can achieve. Typically, high-quality fluorescence datasets generate a coefficient of variance (CV) for F₀ of < ± 1.0% over a 4–6 cycle LRE window (Figure 3).

A second attribute of F₀ plots is characterized by a rapid divergence of F₀ values derived from later cycles within many amplification profiles. As would be anticipated, this coincides precisely with the loss in conformity to the LRE line generated in the corresponding LRE plot. Although the examples presented in Figure 3 produced a sweeping upward arc, downward arcing have also been commonly observed. In either case, the abruptness of this divergence provides a dependable marker for loss of conformity, which in turn allows the upper limit of the LRE window to be objectively defined.

Based upon the general principle of conformity, the utility of using recursive analysis to optimize the LRE window size was explored, primarily to develop an algorithm for automating LRE data analysis. The approach starts by defining a small LRE window in the lower region of a profile. Although software implementation can provide several methods for determining the lower boundary of the LRE window, manual implementation was based upon empirical determination of the lowest F_C reading that generates sufficient precision to produce a reliable E_C value for the first cycle of the LRE window (referred to as the LRE window "start cycle"). Linear regression analysis is conducted on this preliminary LRE window, and the F_C readings across the entire profile converted to F₀ (equation 6). An average F₀ value is then calculated from the F_C readings encompassed by the LRE window. It should be noted that this includes the F_C generated in the cycle immediately preceding the start cycle, due to the fact that this F_C reading is the denominator used to calculate E_C for the start cycle.

The recursive nature of this approach derives from comparing the average F₀ produced by the LRE window, to that generated by the cycle immediately following the last cycle of the LRE window. The difference, expressed as a percentage of the average F₀, is then used to assess the level of conformity. A threshold value (7.5% difference in this study) is then used to determine whether this cycle should be subsumed into the LRE window. If so, the LRE window is expanded to include this next cycle, and the LRE analysis is repeated. This recursive process is continued until a cycle is encountered that produces a F₀ difference larger than the threshold. An example of this recursive approach is presented in Figure 4, which provides a more detailed illustration of the process. Worthy of note are the relatively small differences (typically < 0.5 fold) in the average F₀ produced by expanding the LRE window to include cycles that are clearly nonconforming, as is illustrated in Figure 4. Even though a more formal documentation of the robustness of LRE analysis will not be attempted here, the prototypic Java program introduced at the end of the study provides the ability to quickly assess the impact of LRE window size and position.

F₀ plots thus illustrate the two key attributes that underpin the quantitative capability of the LRE method. The first is the ability to objectively define the LRE window based on conformity of the derived F₀ values. The second is estimating target quantity based upon averaging F₀ values encompassed by the LRE window (typically 4–6 cycles). Furthermore, the CV of the LRE window F₀ values is an effective indicator of the general quality of the fluorescence dataset (Figure 3), which is a major determinant of the general efficacy of LRE-based quantification (see below). Another indicator of the precision that can be achieved with LRE modeling is the correlation between actual F_C readings and that predicted by equation 7. The profiles presented in Figure 5 exemplify high-quality datasets, which can generate predicted F_C values that differ on average < 0.5% of the actual F_C readings within the LRE window.

Maximizing optical precision

Notwithstanding the high degree of precision that can be achieved, it became evident during the early stages of LRE implementation that a number of optical factors can compromise, sometimes severely, the quality of a fluorescence dataset. Before addressing further the quantitative capability of LRE analysis, the role of reaction fluorescence determination and assay optics should first be considered. As might be anticipated, optical precision is a central determinant of the quantitative accuracy and reliability that can be achieved with LRE quantification. Two simple steps can be taken to increase optical precision. The first is to take multiple fluorescence readings at the end of each cycle (three in this study) and to use the average for determining reaction fluorescence. This reduces the error-of-measurement produced by the instrument's optical system. The second is to conduct technical replicates for each sample (four replicates in this study) and to construct a single F_C dataset by averaging the fluorescence readings generated by the replicates.

Although conducting LRE analysis on each individual replicate amplification profile and averaging the resulting ΔE, E_max and F₀ values generally produces comparable results, averaging F_C readings from replicate reactions prior to LRE analysis can increase F_C precision substantially, particularly for F_C datasets of marginal quality. An exception is for samples containing < 10 target molecules, due to the fact that Poisson distribution generates extensive scattering of the replicate amplification profiles, such that F_C averaging becomes less effective.

Another key aspect is to monitor run-to-run optical variances by including a quantitative standard within each run. As described in the next section, lambda gDNA has proven to be a reliable quantitative standard that allows monitoring of the many factors impacting assay optics, through the ability to express the fluorescence intensity of an assay in quantitative terms. This introduces the concept of calibrating the optical component of real-time qPCR, which is the last foundational principle of LRE quantification.

Derivation of absolute scale

One of the principal attributes of employing a universal quantitative standard in combination with SYBR Green I detection is the ability to calibrate fluorescence intensity by expressing assay fluorescence in terms of amplicon mass. Referred to as "optical calibration", this can be accomplished by amplifying a known quantity of a standard and dividing the resulting F₀ value by the predicted target quantity, expressed as nanograms of amplicon DNA (M₀). For lambda gDNA, this takes the form:

M_{0} = \frac{n g L a m b d a \times A_{S}}{48, 502}

(8)

where ngLambda is the mass of lambda gDNA in nanograms, A_Sis the amplicon size and 48,502 is the genome size of lambda (both expressed in base pairs) so that:

O C F = \frac{F_{0}}{M_{0}}

(9)

where OCF is defined as the "optical calibration factor", expressed in this study as fluorescence units per nanogram of double-stranded DNA (FU/ng dsDNA). The ability to express fluorescence intensity in quantitative terms provides a simple, centralized quality control component that incorporates all of the many factors impacting the optics of an assay. These include factors related to reaction setup, such as batch-to-batch variations in enzyme formulation, reaction vessels and closures, in addition to the performance of the optical system.

An illustration of this approach is presented in Figure 6, which summarizes optical calibrations conducted with five reaction formulations supplemented with various quantities of SYBR Green I. Derived from the analyses of 32 individual amplification runs, this large dataset provides a general indication of the variances generated by each LRE parameter, culminating in an OCF average CV of ± 21.3% across all five reaction formulations. All things being equal, this should be indicative of the resolution that a LRE-based quantitative assay can achieve. Indeed, similar variances were generated by repeated quantifications of eleven mRNA targets, as described in the next section. Of greater significance, however, is the ability to use optical calibration to establish an absolute quantitative scale.

The concept of optical calibration was first introduced within the context of standard curve-based quantification, in which it was recognized that correlating reaction fluorescence to DNA mass could provide a simplified method for establishing quantitative scale [9]. Optical calibration in a form similar to that presented here was subsequently described within the context of absolute quantification using nonlinear regression (SCF) [18]. In either case, it was evident that the ease of conducting absolute qPCR would be greatly increased by implementing optical calibration using a universal quantitative standard. In addition to eliminating the need to prepare target-specific standards, when combined with LRE analysis optical calibration circumvents the need to construct standard curves.

Under this approach, absolute quantification is achieved by first converting the derived F₀ value into DNA mass using a lambda-based optical calibration factor (equations 8 and 9):

M_{0} = \frac{F_{0}}{O C F}

(10)

for double-stranded DNA targets, or:

M_{0} = \frac{F_{0}}{(O C F \times 0.5)}

(11)

for single-stranded DNA targets, where M₀ is target mass expressed as nanograms of amplicon DNA. Conversion into the number of target molecules then simply requires relating M₀ to the molecular size of the amplicon:

N_{0} = \frac{M_{0} \times 9.1 \times 10^{11}}{A_{S}}

(12)

were N₀ is the number of target molecules, A_Sis amplicon size in base pairs, and 9.1 × 10¹¹ is the number of base pairs per nanogram of dsDNA [18]. In view of the operational simplicity provided by this approach, few additional requirements would be necessary to fully automate absolute quantification. However, one important caveat remains to be addressed, which is the implicit assumption that all amplicons generate similar fluorescent intensities.

Application to gene expression profiling

Key to utilizing a single absolute scale across multiple targets is the underlying assumption that amplicon-specific factors such as amplicon size and/or base pair composition, do not significantly impact the intensity of SYBR Green I fluorescence (i.e. FU/bp). One approach to testing this assumption would be to compare quantifications generated by a large number of diverse amplicons targeted to a quantified standard (such as lambda gDNA), with the expectation that differences in optical intensity would generate quantitative biases. Although such an analytic approach can be effective, the approach chosen for this study was based upon applying LRE quantification to gene expression analysis. An in-house initiative to develop large-scale expression profiling in Arabidopsis thaliana provided a large database from which to select candidate targets for absolute quantification. Eleven transcripts were selected based primarily on encompassing a quantitative range indicative of transcriptional factors, which for this group of targets was estimated to be 10–10,000 transcript molecules per 10 ng of total RNA. Furthermore, these targets encompass amplicon sizes of 85–150 bp and amplicon GC contents ranging from 40–53% (see Methods for further details), from which it was expected that any substantive differences in amplicon fluorescence would generate recognizable quantitative biases.

The most notable initial outcome was that all amplicons generated amplification profiles that conformed well to the LRE model, as reflected by an average LRE linear regression correlation coefficient (r²) of > 0.995 (Figure 7A, additional file 3). Indeed, of the > 400 primer pairs tested to date, all have conformed well to LRE modeling. Unexpectedly, increasing SYBR Green I quantity had little or no impact on the r² of the LRE analysis, despite a greater than X10 increase in F_max at the highest SYBR Green I quantity examined (Figure 7A). Increasing SYBR Green I quantity also did not reduce the quantitative variance of the derived F₀ values (see below).

This high level of precision is further reflected by the low variances generated by the corresponding E_maxand ΔE determinations (Figure 7B and 7C). Worthy of note is the standard deviation of E_maxdetermination, which averaged ± 1.4% across all targets and reaction formulations. In addition to demonstrating the robust nature of LRE analysis, such low variances support the contention that PCR amplification is an inherently precise process. Furthermore, without delving into the mathematics of the LRE model, it should be noted that E_max has by far the greatest impact on target quantification (a 1% difference in E_max roughly produces a 20–30% difference in the calculated target quantity), such that E_maxestimation is the primary determinant of the quantitative resolution that can be achieved. Based upon the low variances in E_maxand ΔE determinations, it is not surprising that the derived F₀ values for each target generated an overall average CV of ± 21.2% (Figure 7D). As noted earlier, this is very similar to the average OCF CV of ± 21.3%, indicating that LRE has the potential to resolve quantitative differences of < ± 25%. Quantitative resolution also appears not to be impacted by increasing SYBR Green I quantity, with each of the five reaction formulations producing similar F₀ CVs (Figure 7D).

This dataset also illustrates the limitations of expressing target quantities in units dependent on assay setup and instrument optics. In this case, it is not possible to directly compare F₀ values across different enzyme formulations, due to differences in reaction intensity and gain settings. However, converting F₀ quantities into the number of target molecules provides an absolute context that allows comparison of target quantities generated under disparate assay conditions. Although this may seem obvious, it is worth stressing that provision of a universal context is what makes absolute quantification so compelling, in that absolute values transcend issues of assay design, instrumentation, and even the type of data analysis applied. Table 1 provides an illustrative example, which demonstrates that SYBR Green I quantity and enzyme type had a modest, if any, impact on LRE quantification. Furthermore, the average CV of ± 23.4% produced across all five reaction formulations is in general agreement with the contention that LRE quantification is able to resolve quantitative differences smaller than ± 25%.

Table 1 LRE-based absolute quantification of eleven cDNA targets using five enzyme formulations.

Full size table

Notwithstanding the high level of precision that can be achieved, it is important to note that despite expressing quantities as the number of target molecules, the quantitative context is still confined to the LRE analyses used to generate the dataset. That is, this type of comparison is unable to verify absolute accuracy, in that any biases generated by LRE analysis and/or optical calibration will generate quantitative biases. As is presented in the following two sections, the quantitative context can be expanded by determining how closely target quantities produced by other methods correlate with the LRE-derived quantities.

Absolute quantification via C_t

Positional analysis, as exemplified by the threshold method, has predominated since the introduction of real-time PCR over 15 years ago [35], and is the quantitative methodology upon which all commercial platforms currently rely. Based upon the fractional cycle at which reaction fluorescence reaches a threshold value, the threshold method defines profile position through a common reference point called the threshold cycle or "C_t". Absolute quantification is accomplished via standard curves constructed with target-specific quantified standards [9], which are technically challenging and prone to generating quantitative errors. However, previous recognition that a sigmoidal-derived E_max is analogous to a slope-derived amplification efficiency generated from a standard curve [18] suggests an alternative method for converting C_t values into target molecules, using the exponential equation:

F_{0} = \frac{F_{t}}{{(E_{\max} + 1)}^{C_{t}}}

(13)

were F_tis the fluorescence threshold used to derive C_t, and E_maxis the amplification efficiency derived from LRE analysis. Importantly, this approach allows the application of optical calibration so that C_t-based absolute quantification can be conducted without standard curves.

Table 2 summarizes the target quantities produced by this approach using the C_t values generated from the amplifications summarized in Figure 7 (see additional file 3 for more details). Similar levels of quantitative variance were seen across all five reaction formulations as compared with those produced by LRE quantification (Table 1). Not only does this confirm the high level of quantitative precision that can be achieved with real-time qPCR, it provides the opportunity to further extend the quantitative context by comparing C_t- and LRE-based quantifications.

Table 2 Absolute quantification based upon conversion of C_t to F₀ via E _max and F_t.

Full size table

Figure 8 presents Log₂ plots and numerical summaries comparing LRE and C_t derived quantities of the eleven cDNA targets, for each of the five reaction formulations. The high level of correlation reflected by an average r² of > 0.995 provides strong corroborating evidence for the precision of LRE quantification. An important qualification, however, is that C_t quantification conducted in this fashion is dependent upon both E_max and OCF. As such, any quantitative biases generated by LRE analysis will impact the absolute accuracy of both LRE- and C_t-based quantifications. It should be noted that standard curves are also susceptible to introducing quantitative biases, produced for example, by errors in quantification of the standard and/or preparation of the dilution series. Importantly, this highlights the dilemma associated with any real-time qPCR assay, irrespective of how it is implemented; that is, how to effectively assess the true quantitative accuracy, referred to in this study as "absolute accuracy".

Accuracy is typically determined by repeatedly measuring some traceable reference standard. While this approach could be effective for well-characterized targets, the use of reference standards is impractical for applications involving large numbers of diverse targets. What would be more effective is an alternative quantitative method that would, as much as possible, be free from potential biases generated by real-time qPCR. A superbly effective solution has been provided by Wang and Spadoro [36], which exploits the single-molecule sensitivity inherent to PCR amplification. Founded upon the principles of Poisson distribution, this approach provides an elegant means for absolute quantification that would be familiar to any microbiologist or virologist; that is, the method of "limiting dilution assay".

Absolute quantification via limiting dilution assay (LDA)

The efficacy of LDA derives from its ability to achieve absolute quantification independent of the kinetic and optical principles upon which real-time qPCR is dependent, and does not require a quantitative standard. LDA relies solely on the frequency of reactions that fail to produce an amplification product, utilizing PCR to only determine whether a target molecule is present within an individual aliquot. Furthermore, the assay is not dependent on how the PCR amplification is actually conducted, requiring only that it generate single-molecule sensitivity, and that amplification reactions that generate false positives (e.g. primer dimers) are either absent or can be identified. Thus, an economical SYBR Green I-based LDA could, for example, be used to evaluate the quantitative accuracy of a probe-based assay.

As the name implies, LDA involves diluting the sample to a limit, which in this case are individual target molecules. As dictated by Poisson distribution, when a sample is diluted to a point near to one target molecule per aliquot, a high proportion of aliquots will not contain a target molecule. It is the frequency of these "nil" aliquots that allow the average target quantity to be calculated using the equation:

Average target molecules per aliquot = -Ln(Nil/Total)

where Nil and Total are the number of reactions that fail to produce amplicon DNA and the total number of reactions conducted, respectively [36]. Target quantity is then calculated based on the dilution factor used to reach single a molecule concentration. It is important to note that LDA is self-validating, such that if the sample is under-diluted no nil reactions will be produced, whereas if the sample is over-diluted all reactions will be nil. As a result of this intrinsic self-validation, LDA has in practice proven to be exceptionally reliable.

Table 3 summarizes results from LDAs conducted on all eleven mRNA targets, for which the dilution factor applied was based upon the LRE quantification for each respective target. Comparing the LDA quantifications to the LRE quantifications via Log₂ plots, four of the five reaction formulations generated an r² > 0.99, with the FullVelocity formulation generating an r² of 0.97 (Figure 9). Excluding the FullVelocity dataset, this translates to an absolute difference of < ± 20% on average. Overall, this dataset not only illustrates the level of absolute accuracies that can be achieved with LRE quantification, but also indicates that any quantitative biases generated by differences in amplicon fluorescence intensity are most likely to be small.

Table 3 Absolute quantification via limiting dilution assay.

Full size table

Recognized anomalies impacting LRE quantification

During the development and testing of LRE quantification, a number of factors were identified that either compromise or prohibit effective application of LRE quantification. Referred to here as "anomalies", these actually encompass a broad variety of factors, ranging from primer pair performance to optical precision and data processing of fluorescence readings.

Compromised efficiency of target priming and elongation

A fundamental principle impacting the quantitative accuracy of real-time qPCR is simple in principle, but not necessarily apparent; that is, disrupting target priming and elongation. Best exemplified by single nucleotide polymorphisms (SNPs), base-pair mismatches between a primer and the target can severely disrupt initiation of PCR amplification, but once an amplicon molecule is formed these mismatches are lost, such that amplicon amplification proceeds unhindered. LRE analysis of the resulting amplification profile is thus incapable of detecting target priming anomalies. Indeed, any factor that disrupts the efficiency of target priming and elongation that does not equally reduce the efficiency of amplicon priming and elongation, will necessarily generate an under-estimation of target quantity.

Such a situation can be prevalent for species or genotypes where SNP occurrence is undocumented. A simple but effective solution is to compare quantifications generated by multiple amplicons per target (see Methods for additional details). Disruption of target priming and elongation then becomes evident as a shift to later cycles, of the profile generated by the impacted primer(s). Although not yet formally documented, preliminary work has also indicated that certain PCR inhibitors can also selectively disrupt target priming and elongation, an issue that is fundamentally more difficult to address.

Compromised optical integrity

Factors impacting assay optics can severely compromise LRE quantification. A simple example encountered during an early set of experiments, was a problem eventually traced back to optical variance in the reaction tubes, which produced up to a 40% difference in fluorescence intensity between identical amplification reactions (data not shown). As described earlier, monitoring assay optics via amplification of quantitative standard such as lambda gDNA, has proven effective for identifying such optical anomalies via differences in the resulting OCF values.

The initial processing of fluorescence datasets is another factor that can impact optical precision. It is paramount to ensure that beyond fluorescence background subtraction, no additional modifications are performed on the F_C datasets. The most commonly encountered data manipulation is often referred to as "curve smoothing" in which a running average is applied in order to generate more aesthetically pleasing amplification profiles. Even though the apparent sigmoidal character of the resulting profiles can be improved (i.e. increased linearity of the LRE plot), such modifications significantly distort LRE analysis, the most evident being a large reduction in E_max. Some forms of optical normalization could also be expected to be problematic, although this has not been formally investigated. As a general rule, it is recommended that as few modifications as possible be conducted on F_C datasets, even to the point of conducting background subtraction manually in order to ensure an accurate baseline estimate, as was necessary in this study (see Methods for additional details).

Baseline drift

Another form of F_C data modification attempts to correct one of the most acute anomalies we have recognized to date; that is changes in the background fluorescence. Referred to as "baseline drift", we have found that some primer pairs, in addition to sample-specific factors can produce a progressive increase in fluorescence background. Importantly, some data processing packages attempt to correct for fluorescence drift by adjusting the values of the F_C readings. While this can be somewhat effective in reducing the quantitative inaccuracies produced by baseline drift, it also leads to unintended distortions that remain hidden if the user is unaware of the underlying F_C manipulation (see Methods for additional details). Baseline drift can also be quite subtle and is most effectively recognized by visual examination of raw fluorescence data before subtraction of background fluorescence.

Our investigation into the source of baseline drift using SYBR Green I detection has revealed it to be a complex phenomenon, likely generated by several mechanisms. One clear trend is a strong primer-specific effect (Figure 10). Unfortunately, repeated attempts to identify reliable predictors, such as sequence complementarity or secondary structural elements (e.g. hairpin loops) within the primers, have to date been largely unsuccessful. Unidentified sample-specific factors have also been found to generate extensive baseline drift across multiple primer pairs that normally do not generate baseline drift. Raising the annealing and elongation temperature and/or diluting the sample, as well as re-purifying the RNA before conducting reverse transcription, have been found to reduce baseline drifting in some cases.

Automation

LRE quantification is primarily a matter of data processing. In practice, the only component that requires any significant insight is LRE window selection. If implementation of LRE analysis could be relegated to a computer program, the user would only need to supply the size of the amplicon and an optical calibration factor in order to complete the quantification. A prototypic Java program (additional file 4) supports this contention by automating start cycle selection and optimizing LRE window size using the recursive approach described earlier. This program brings together all of the elements of LRE quantification, providing the opportunity to examine the interrelationships between LRE window selection, the resulting F₀ and N₀ values, and the predicted F_C values generated by the LRE model. This includes the ability to manually adjust the LRE window, to change the associated C_t value by adjusting the fluorescence threshold, and to enter values for the OCF and amplicon size (Figure 11). Entry of an F_C dataset via pasting from clipboard also provides limited data processing capabilities.

Comparison to other automated data processing packages

To further examine the performance capabilities of LRE, the automated LRE quantification provided by the prototypic Java program was compared to two other publicly available automated data processing packages, which also analyze the F_C readings generated by individual amplification reactions. The first called LinReg, has become a commonly used package for determining amplification efficiency without the use of a standard curve, and employs linear regression analysis of the log-linear region present within the lower region of an amplification profile. Based upon the presumption that amplification efficiency is constant within this region, amplification efficiency is determined from the slope of a Log(F_C) vs. cycle plot, with target quantity (F₀) determined by the intercept, that is, when cycle = 0 [12]. The second package called Miner [15] determines amplification efficiency via nonlinear regression using the same exponential model upon which LinReg is based. Miner also generates a C_t value based upon a dynamic fluorescence threshold (F_t). Although Miner does not provide an F₀ value, equation 13 allows a F₀ value to be calculated using the amplification efficiency, F_t and C_t values generated by Miner.

The approach taken for the comparison was to select a single, representative amplification profile for each of the eleven cDNA targets, across all five assay formulations. This provided a total of 55 amplification profiles for analysis, which are provided in additional file 5. Each amplification profile was then subjected to automated analysis by each of the three packages using default values, from which an amplification efficiency estimate and an F₀ value were obtained (additional file 6). The ultimate objective, however, was to assess the absolute quantitative accuracies by comparison to the LDA quantifications presented in Table 3. In order to accomplish this, it was first necessary to conduct individual optical calibrations for each of the three analysis packages, in order to compensate for the biases specific to each package. Most notable is that both LinReg and Miner generated amplification efficiency estimates that were generally lower than that produced by LRE (additional file 6). All of the lambda gDNA amplification profiles (additional file 7) were thus subjected to analysis by each of the three packages, producing an optical calibration factor specific to each analysis package for each of the five assay formulations. F₀ values were then converted to the number of target molecules using the corresponding OCF, as summarized in additional file 6.

Figure 12 provides a summary of the comparison, in which the quantifications produced by each package are compared to the LDA quantification via Log₂ plots. Overall, Java-LRE performed the best, producing an average r² value of 0.984, while Miner and LinReg produced average r² values of 0.942 and 0.922, respectively. Differences in performance become more apparent when the quantitative differences are expressed as the percentage of LDA quantity, as presented in additional file 6. Both Miner and LinReg performed less well, producing 2.4X and 6.1X greater variance, respectively, than Java-LRE (additional file 6).

Discussion

What should be expected from real-time quantitative PCR?

Often for historical reasons expectations for quantitative PCR can differ greatly, and can be influenced as much by personal perspective as by methodological considerations. The plethora of choices currently available for detection chemistry, enzyme formulation, cycling regime and instrumentation provide many compelling examples. It is not uncommon, for example, for widely differing protocols to be applied to seemingly identical applications, with little or no supporting evidence that any single assay design is superior. Exacerbated by a profound lack of standardization, it is not surprising that many reports caution about the limitations of real-time qPCR [28, 37–39].

Evaluating real-time qPCR technologies is further complicated by the fact that performance expectations can be highly context dependent, and that the application of performance standards (if any) can vary as widely as context. Biomedical diagnostics, for example, provide many poignant scenarios in which assay performance supersedes assay design. Without verifiable accuracy and reliability, methodological details can become immaterial. Arguably, gene expression analysis often represents the other extreme where, for reasons of technical simplicity, transcript quantities are frequently expressed as relative differences. In addition to providing a very limited quantitative context, relative quantification provides little or no opportunity to assess quantitative accuracy.

Despite a broad range of performance expectations, it should be evident that absolute qPCR can enhance the efficacy of any quantitative assay, irrespective of context. Not only does absolute quantification impart a universal perspective that facilitates data interpretation, it also allows assay performance to be defined in absolute terms. Furthermore, absolute quantification allows decoupling of target quantification from assay implementation, such that quantitative data generated by disparate assay designs and/or data processing methodologies can be directly compared.

Notwithstanding the apparent utility of absolute quantification, the technical complexity and resources required by current protocols is daunting. Not only has this greatly impeded broad adoption of absolute quantification, the complexities of implementing even the most basic quantitative assay hinders access to real-time qPCR technologies, particularly for casual users. Furthermore, the necessity for constructing target-specific standard curves severely limits both the efficacy and capacity of absolute quantification.

Founded on recognition that PCR amplification is inherently sigmoidal, this study describes methodologies that provide effective, and in some cases simple solutions for conducting absolute quantification without standard curves. Utilizing a kinetic-based approach, LRE analysis can be applied to any SYBR Green I-based assay, with few qualifications other than that the F_C datasets be of reasonable quality. LRE quantification does not rely on user-supplied standards and, if automated data processing is implemented, requires little or no training beyond that required for preparing amplification reactions. Particularly in view of the impact that absolute qPCR could have on a broad range of applications, these attributes alone provide a compelling argument for moving beyond the historical, often dogmatically held concepts that have persisted since the introduction of real-time qPCR. Key to this endeavor is to develop an effective understanding of the fundamental principles of absolute quantification, many of which transcend details of assay design and data analysis methodology.

The two founding principles of absolute quantification

Despite the seemingly complex mix of technologies and methodologies, absolute quantification requires measurement of only two fundamental parameters – amplification kinetics and quantitative scale – regardless of detection chemistry, enzymology or instrumentation. Historically, this has been accomplished by constructing target-specific standard curves, in which amplification efficiency is derived from the slope and quantitative scale is derived from the intercept [9].

Amplification efficiency determination

Assessing amplification kinetics has long been recognized as a major factor impacting qPCR, due to the fact that errors in amplification efficiency determination can lead to large quantitative errors. Nevertheless, early real-time qPCR protocols simplified target quantification by assuming amplification efficiency to be identical for all amplicons and all samples [40]. Indeed, although the slope of a standard curve provides an estimate of amplification efficiency, a similar assumption must still be made; that is that the amplification efficiencies of all samples are identical, or at least similar, to that predicted by a standard curve. Thus, even if a standard curve can be effectively constructed, quantitative accuracy cannot be ensured due to the potential for sample-specific factors to reduce amplification efficiency. This can be a major concern, particularly for samples originating from sources known to contain inhibitory compounds, such as for environmental samples and for many types of biomedical samples. This deficiency alone would be expected to exclude real-time qPCR from a potentially large category of applications, where unidentified quantitative errors cannot be tolerated. LRE analysis provides a fundamental solution to this issue, through the ability to monitor amplification efficiencies within individual amplification reactions.

Notwithstanding the innate limitations of standard curve-based quantification, the implications of amplification efficiency determination extend beyond issues of quantitative accuracy, playing a predominant role in the operational component of real-time qPCR. Amplification kinetics is determined by a combination of reaction setup and cycling regime. Thus, amplification efficiency must be determined for every new amplicon, and re-determined if changes are made to reaction setup and/or cycling regime. As such, difficulties in determining amplification efficiency limit the number of targets and reaction conditions that can be tested. Under this context it becomes apparent that LRE confers operational attributes important to developing reliable, high-capacity qPCR applications, which are beyond what are possible using current technologies. For example, LRE analysis allows performance assessments to be based on amplification of bona fide samples, as opposed to the common practice of using artificial targets such as plasmids or oligos, and with a capacity limited only by the number of amplification reactions that can be run. Of a more fundamental nature is the potential to extend performance assessment beyond amplification efficiency, to what could be termed as "assay robustness". This could allow assay performance to encompass parameters such as resilience to inhibitors and/or response to changes in cycling regimes (e.g. short annealing and elongation times), which are only two among many possible examples.

Derivation of absolute scale via optical calibration

Although it is generally recognized that amplification efficiency can be derived from the slope of a standard curve, little or no attention has been given to the fact that the intercept establishes absolute scale by relating DNA mass to reaction fluorescence intensity [9]. Recognition of this fundamental principle presents a simple solution to the greatest limitation associated with conducting high-capacity absolute quantification using current protocols, which is reliance on target-specific standards. Based on the presumption that SYBR Green I generates similar fluorescence intensity for all amplicons, lambda gDNA can be exploited as a universal quantitative standard for establishing absolute scale, using a simple, standardized protocol referred to as optical calibration [18]. Furthermore, by relegating establishment of quantitative scale to a universal standard, error of scale is restricted to a single, well-defined entity.

Importantly, utilization of optical calibration is not limited to sigmoidal-based quantification. C_t-based quantification can utilize the same strategy if C_t values are converted into F₀. This can be accomplished using the fluorescence threshold in combination with the E_max derived from LRE analysis (equation 13). Additionally, when the fluorescence threshold is not fixed [9], the impact of inter-run differences in C_t values are eliminated, as differences in F_t are compensated for during the conversion of C_t to F₀. Finally, it should be noted that even though the principles of optical calibration described here have been developed using SYBR Green I, Swillens et al. (2004) describe an optical calibration methodology for hydrolysis probes that correlates fluorescence intensity to probe mass [34]. This suggests that probe-based assays could also implement optical calibration for conducting absolute quantification.

Assay validation via limiting dilution assay

For any analytical technique, the ultimate performance benchmarks are accuracy and reliability. The relevancy of this to real-time qPCR is particularly evident in view of the large number of available choices for detection chemistry, enzyme formulation and instrumentation, all of which generates an enormous number of options to choose from. Indeed, many choices are based on the presumption of superior performance, even at the expense of reducing the practicalities of assay implementation and/or of increased cost. However, despite the many claims of superior performance, the paucity of supporting evidence can be striking.

Limiting dilution assay provides a fundamental, potentially universal solution to the dilemma of how to effectively assess quantitative accuracy, through the ability to conduct absolute quantification independent of real-time qPCR. LDA is simple to conduct, is independent of the kinetic and optical parameters upon which real-time qPCR is founded, does not require a quantified standard and is intrinsically self validating. As such, LDA provides potential solutions to the long-standing challenge of effectively determining true differences in assay performance, whether comparing reaction formulation, instrumentation, or as is the case for this study, data processing models. In view of the multitude of choices that currently confounds real-time qPCR, LDA could be instrumental to establishing standards in which absolute accuracy is the hallmark of assay performance.

Conclusion

Founded upon a new paradigm for real-time qPCR, this study introduces several novel concepts and methodologies that extend the fundamental capabilities of absolute quantification. Most notable is the ability to monitor amplification kinetics within individual amplification reactions, providing the capability to reveal sample-specific inhibition that would otherwise generate unidentified quantitative errors. In addition, utilizing lambda gDNA for optical calibration not only eliminates reliance on target-specific standard curves, but as well, contributes to the standardization of real-time qPCR by centralizing the provision of quantitative scale to a single, highly defined, universal quantitative standard. Exploiting limiting dilution assay for absolute quantification provides the capability to independently evaluate absolute accuracy, irrespective of assay methodology, which could also contribute greatly to the standardization of real-time qPCR technologies. In relation to operational issues, LRE provides several attributes that facilitate large-scale absolute quantification, with the potential to extend assay performance to include the general concept of assay robustness. Ultimately, however, the ability to automate LRE quantification is most illustrative of the potential for developing high-capacity applications, reducing the resources required for conducting absolute quantification to little beyond that needed for reaction preparation.

Methods

SigmaPlot (Version 8) was used to generate the plots presented in Figure 1 was derived from an arbitrarily selected F_C dataset amplification profile using the SCF method [18], which produced: k = 1.51, C_1/2 = 21.02, F_max= 10,837, F_b= 0. LRE analysis was conducted with MS Excel using the templates provided in additional file 2. Nonlinear correlation coefficients of predicted fluorescence profiles presented in the prototypic Java program (Figure 11) were calculated over the range of cycles encompassed by the LRE window using the equation:

R^{2} = 1 - \frac{\sum {(F_{C} - F_{P})}^{2}}{\sum {(F_{C} - F_{a v})}^{2}}

where F_C is reaction fluorescence and F_Pis the predicted reaction fluorescence at cycle C, with F_avbeing the average reaction fluorescence generated by the cycles encompassed by the LRE window.

RNA extracted from 6-day-old Arabidopsis seedlings was reverse transcribed using oligo dT and Superscript II (Invitrogen) at a concentration of 100 ng total RNA per μl, using the manufacturer's recommended reaction conditions except that no RNase H treatment was conducted. Following 10X dilution in 10 mM Tris to a final concentration of 10 ng total RNA per μl, aliquots of the reverse transcriptase reaction were stored at -20°C.

Three different enzyme formulations were used in this study: QuantiTect (Qiagen), FullVelocity (Stratagene) and DyNAmo (Finnzymes, distributed by New England BioLabs). SYBR Green I quantity is expressed in units designated by the manufacturer (Invitrogen). SYBR Green I was diluted to the appropriate quantity using ddH₂0 before addition to the PCR master mix just prior to amplification reaction preparation. Based upon the relative increase in F_max as SYBR Green I quantity was increased (Figure 2), the quantity of SYBR Green I in all three commercial enzyme formulations can be estimated to be about 0.1–0.2X. The highest quantity of SYBR Green I tested in this study is thus estimated to be 10X above that typically found in commercial enzyme formulations used for real-time qPCR.

Initiated by selecting an amplicon position near to 3' end of each target transcript using the stop codon as a general landmark, primer design was primarily based upon selecting a primer length that generated a predicted T_m = 70°C using FastPCR (Institute of Biotechnology, University of Helsinki, Finland) with parameters set to 0.05 M KCl, 0.05 M (NH₄)₂SO₄, 0.0025 M MgCl₂ and 500 nM primer concentration. Although amplicon size was restricted to 80–150 bp, no regard was given to spanning introns or to predicted secondary structures within the primers. Four amplicons were generated for each target transcript by pairing two 5' and two 3' primers. In addition to exploiting the high capacity provided by LRE analysis for assessing primer pair performance, this approach allowed quantitative accuracy to be broadly tested by the level of correlation generated from each of the four amplicons. This was particularly effective for identifying single nucleotide polymorphisms which can generate large quantitative errors produced by disrupting primer annealing to the target. Although this may not be a significant factor for highly characterized genomes, this approach can be effective for verifying quantitative accuracy for disparate species and/or new genotypes (see Results for additional details). For this study a single amplicon was selected for each target (Table 4).

Table 4 cDNA targets and PCR primers.

Full size table

Replicate amplification sets consisting of four 5.0 μl reactions in low profile tubes sealed with ultra clear caps (ABgene) were taken from a 25 μl master mix containing the target and 500 nM of each primer. For transcript quantifications, the target consisted of an aliquot of the diluted reverse transcriptase reaction containing the equivalent of 10 ng total RNA per amplification reaction, or for optical calibration, lambda gDNA (New England BioLabs) at a quantity specified in Figure 6, amplified with the lambda primers K7B-K12 (Table 4). All amplifications were conducted with a Mx3000P spectrofluorometric thermal cycler (Stratagene) using a two temperature cycling regime initiated with a 15 min activation at 95°C, followed by 50 cycles of 120 s annealing and elongation at 65°C for QuantiTect and DyNAmo or 70°C for FullVelocity and a 10 s denaturation at 95°C. To increase optical precision, three fluorescent reads were taken at the end of the annealing and elongation step and the average used as an estimate of reaction fluorescence. Specificity of amplification was confirmed by melting curve analysis conducted at the end of each run.

It is important to note that the Stratagene MxPro-Mx3000P v3.00 software used in this study was found to generate anomalous fluorescence background subtraction, as a result of including into the baseline average of 4–6 cycles after amplicon DNA first becomes detectable. This necessitated manual adjustment for each profile, of the region used to estimate baseline fluorescence. Additionally, as illustrated in Figure 10, this version of the MxPro software also attempts to correct for baseline drifting by modifying the value of each F_C readings, a data manipulation process that cannot be disabled. This necessitated manual background subtraction as implemented in the Excel template used for LRE quantification (additional file 2). Note also that Cikos et al. (2007) reported problems with background subtraction with their Mx3000P platform which required them to conduct SCF analysis using raw fluorescence data [31].

Limiting dilution assays were conducted for each transcript target, using a master mix prepared from the reverse transcriptase reaction diluted to generate a predicted target quantity of 0.5–1 molecules per 5.0 μl amplification reaction. Dilutions were prepared in 10 mM Tris using siliconized microfuge tubes, and were based upon LRE quantification of the originating reverse transcriptase reaction. Amplifications were conducted identically to that used for LRE quantification, and the number of nil reactions scored, based upon a lack of an amplicon production. Although production of primer dimers was nearly absent across the eleven targets examined in this study, in rare cases when reactions produced primer dimers during late cycles (> 40 cycles) they were scored as negative reactions. Note that all N₀ quantifications in this study are expressed as the number of target molecules per 10 ng total RNA.

References

Canales RD, Luo Y, Willey JC, Austermiller B, Barbacioru CC, Boysen C, Hunkapiller K, Jensen RV, Knight CR, Lee KY, Ma Y, Maqsodi B, Papallo A, Peters EH, Poulter K, Ruppel PL, Samaha RR, Shi L, Yang W, Zhang L, Goodsaid FM: Evaluation of DNA microarray results with quantitative gene expression platforms. Nat Biotechnol. 2006, 24 (9): 1115-1122. 10.1038/nbt1236.
Article CAS Google Scholar
Wang Y, Barbacioru C, Hyland F, Xiao W, Hunkapiller KL, Blake J, Chan F, Gonzalez C, Zhang L, Samaha RR: Large scale real-time PCR validation on gene expression measurements from two commercial long-oligonucleotide microarrays. BMC Genomics. 2006, 7: 59-10.1186/1471-2164-7-59.
Article Google Scholar
Qin LX, Beyer RP, Hudson FN, Linford NJ, Morris DE, Kerr KF: Evaluation of methods for oligonucleotide array data via quantitative real-time PCR. BMC Bioinformatics. 2006, 7: 23-10.1186/1471-2105-7-23.
Article Google Scholar
Draghici S, Khatri P, Eklund AC, Szallasi Z: Reliability and reproducibility issues in DNA microarray measurements. Trends Genet. 2006, 22 (2): 101-109. 10.1016/j.tig.2005.12.005.
Article CAS Google Scholar
Miller LD, Liu ET: Expression genomics in breast cancer research: microarrays at the crossroads of biology and medicine. Breast Cancer Res. 2007, 9 (2): 206-10.1186/bcr1662.
Article Google Scholar
Buyse M, Loi S, van't Veer L, Viale G, Delorenzi M, Glas AM, d'Assignies MS, Bergh J, Lidereau R, Ellis P, Harris A, Bogaerts J, Therasse P, Floore A, Amakrane M, Piette F, Rutgers E, Sotiriou C, Cardoso F, Piccart MJ: Validation and clinical utility of a 70-gene prognostic signature for women with node-negative breast cancer. J Natl Cancer Inst. 2006, 98 (17): 1183-1192.
Article CAS Google Scholar
Fan C, Oh DS, Wessels L, Weigelt B, Nuyten DA, Nobel AB, van't Veer LJ, Perou CM: Concordance among gene-expression-based predictors for breast cancer. N Engl J Med. 2006, 355 (6): 560-569. 10.1056/NEJMoa052933.
Article CAS Google Scholar
Schramm A, Vandesompele J, Schulte JH, Dreesmann S, Kaderali L, Brors B, Eils R, Speleman F, Eggert A: Translating expression profiling into a clinically feasible test to predict neuroblastoma outcome. Clin Cancer Res. 2007, 13 (5): 1459-1465. 10.1158/1078-0432.CCR-06-2032.
Article CAS Google Scholar
Rutledge RG, Côté C: Mathematics of quantitative kinetic PCR and the application of standard curves. Nucleic Acids Res. 2003, 31 (16): e93-10.1093/nar/gng093.
Article CAS Google Scholar
Gentle A, Anastasopoulos F, McBrien NA: High-resolution semi-quantitative real-time PCR without the use of a standard curve. Biotechniques. 2001, 31 (3): 502, 504-6, 508.
Google Scholar
Liu W, Saint DA: A new quantitative method of real time reverse transcription polymerase chain reaction assay based on simulation of polymerase chain reaction kinetics. Anal Biochem. 2002, 302: 52-59. 10.1006/abio.2001.5530.
Article CAS Google Scholar
Ramakers C, Ruijter JM, Deprez RH, Moorman AF: Assumption-free analysis of quantitative real-time polymerase chain reaction (PCR) data. Neurosci Lett. 2003, 339 (1): 62-66. 10.1016/S0304-3940(02)01423-4.
Article CAS Google Scholar
Peirson SN, Butler JN, Foster RG: Experimental validation of novel and conventional approaches to quantitative real-time PCR data analysis. Nucleic Acids Res. 2003, 31 (14): e73-10.1093/nar/gng073.
Article Google Scholar
Bar T, Stahlberg A, Muszta A, Kubista M: Kinetic Outlier Detection (KOD) in real-time PCR. Nucleic Acids Res. 2003, 31 (17): e105-10.1093/nar/gng106.
Article Google Scholar
Zhao S, Fernald RD: Comprehensive algorithm for quantitative real-time polymerase chain reaction. J Comput Biol. 2005, 12 (8): 1047-1064. 10.1089/cmb.2005.12.1047.
Article CAS Google Scholar
Liu W, Saint DA: Validation of a quantitative method for real time PCR kinetics. Biochem Biophys Res Commun. 2002, 294 (2): 347-353. 10.1016/S0006-291X(02)00478-3.
Article CAS Google Scholar
Tichopad A: Improving quantitative real-time RT-PCR reproducibility by boosting primer-linked amplification efficiency. Biotechnol Lett. 2002, 24: 2053-2056. 10.1023/A:1021319421153.
Article CAS Google Scholar
Rutledge RG: Sigmoidal curve-fitting redefines quantitative real-time PCR with the prospective of developing automated high-throughput applications. Nucleic Acids Res. 2004, 32 (22): e178-10.1093/nar/gnh177.
Article CAS Google Scholar
Alvarez MJ, Vila-Ortiz GJ, Salibe MC, Podhajcer OL, Pitossi FJ: Model based analysis of real-time PCR data from DNA binding dye protocols. BMC Bioinformatics. 2007, 8: 85-10.1186/1471-2105-8-85.
Article CAS Google Scholar
Chervoneva I, Li Y, Iglewicz B, Waldman S, Hyslop T: Relative quantification based on logistic models for individual polymerase chain reactions. Stat Med. 2007, 26 (30): 5596-5611. 10.1002/sim.3127.
Article Google Scholar
Mehra S, Hu WS: A kinetic model of quantitative real-time polymerase chain reaction. Biotechnol Bioeng. 2005, 91 (7): 848-860. 10.1002/bit.20555.
Article CAS Google Scholar
Saha N, Watson LT, Kafadar K, Ramakrishnan N, Onufriev A, Mane S, Vasquez-Robinet C: Validation and estimation of parameters for a general probabilistic model of the PCR process. J Comput Biol. 2007, 14 (1): 97-112. 10.1089/cmb.2006.0123.
Article CAS Google Scholar
Smith MV, Miller CR, Kohn M, Walker NJ, Portier CJ: Absolute estimation of initial concentrations of amplicon in a real-time RT-PCR process. BMC Bioinformatics. 2007, 8: 409-10.1186/1471-2105-8-409.
Article Google Scholar
Gevertz JL, Dunn SM, Roth CM: Mathematical model of real-time PCR kinetics. Biotechnol Bioeng. 2005, 92 (3): 346-355. 10.1002/bit.20617.
Article CAS Google Scholar
Lalam N: Estimation of the reaction efficiency in polymerase chain reaction. J Theor Biol. 2006, 242 (4): 947-953. 10.1016/j.jtbi.2006.06.001.
Article CAS Google Scholar
Lalam N: Statistical inference for quantitative polymerase chain reaction using a hidden markov model: a Bayesian approach. Stat Appl Genet Mol Biol. 2007, 6: Article10-
Google Scholar
Goll R, Olsen T, Cui G, Florholmen J: Evaluation of absolute quantitation by nonlinear regression in probe-based real-time PCR. BMC Bioinformatics. 2006, 7: 107-10.1186/1471-2105-7-107.
Article Google Scholar
Karlen Y, McNair A, Perseguers S, Mazza C, Mermod N: Statistical significance of quantitative PCR. BMC Bioinformatics. 2007, 8: 131-10.1186/1471-2105-8-131.
Article Google Scholar
Roussel Y, Harris A, Lee MH, Wilks M: Novel methods of quantitative real-time PCR data analysis in a murine Helicobacter pylori vaccine model. Vaccine. 2007, 25 (15): 2919-2929. 10.1016/j.vaccine.2006.07.013.
Article CAS Google Scholar
Qiu H, Durand K, Rabinovitch-Chable H, Rigaud M, Gazaille V, Clavere P, Sturtz FG: Gene expression of HIF-1alpha and XRCC4 measured in human samples by real-time RT-PCR using the sigmoidal curve-fitting method. Biotechniques. 2007, 42 (3): 355-362.
Article CAS Google Scholar
Cikos S, Bukovska A, Koppel J: Relative quantification of mRNA: comparison of methods currently used for real-time PCR data analysis. BMC Mol Biol. 2007, 8: 113-10.1186/1471-2199-8-113.
Article Google Scholar
Durtschi JD, Stevenson J, Hymas W, Voelkerding KV: Evaluation of quantification methods for real-time PCR minor groove binding hybridization probe assays. Anal Biochem. 2007, 361 (1): 55-64. 10.1016/j.ab.2006.11.023.
Article CAS Google Scholar
Swillens S, Dessars B, Housni HE: Revisiting the sigmoidal curve fitting applied to quantitative real-time PCR data. Anal Biochem. 2008, 373 (2): 370-376. 10.1016/j.ab.2007.10.019.
Article CAS Google Scholar
Swillens S, Goffard JC, Marechal Y, de Kerchove d'Exaerde A, El Housni H: Instant evaluation of the absolute initial number of cDNA copies from a single real-time PCR curve. Nucleic Acids Res. 2004, 32 (6): e53-10.1093/nar/gnh053.
Article Google Scholar
Higuchi R, Fockler C, Dollinger G, Watson R: Kinetic PCR analysis: real-time monitoring of DNA amplification reactions. Biotechnology. 1993, 11: 1026-1030. 10.1038/nbt0993-1026.
Article CAS Google Scholar
Wang Z, Spadoro J: Determination of target copy number of quantitative standards used in PCR-based diagnostic assays. Gene Quantification. Edited by: Ferré F. 1998, Boston , Birkhäuser, 31-43.
Chapter Google Scholar
Bustin SA, Nolan T: Pitfalls of quantitative real-time reverse-transcription polymerase chain reaction. J Biomol Tech. 2004, 15 (3): 155-166.
Google Scholar
Nolan T, Hands RE, Bustin SA: Quantification of mRNA using real-time RT-PCR. Nat Protoc. 2006, 1 (3): 1559-1582. 10.1038/nprot.2006.236.
Article CAS Google Scholar
Skern R, Frost P, Nilsen F: Relative transcript quantification by quantitative PCR: roughly right or precisely wrong?. BMC Mol Biol. 2005, 6 (1): 10-10.1186/1471-2199-6-10.
Article Google Scholar
Livak KJ, Schmittgen TD: Analysis of relative gene expression data using real-time quantitative PCR and the 2-DDCt method. Methods. 2001, 25: 402-408. 10.1006/meth.2001.1262.
Article CAS Google Scholar

Download references

Acknowledgements

We thank Armand Séguin for his support of the project and for the many stimulating discussions, and Ariane Plourde who recognized the significance and supported the work during the early stages of the project. We are indebted to Michel Cusson, Pierre Fobert, Frédéric Vigneault, Brian Boyle, Mark Andersen, and especially Jeffrey Dean for critically reviewing the manuscript, in addition to Pamela Cheers for editorial assistance. The research was supported in part by a grant to RGR from the National Biotechnology Strategy of Canada.

Author information

Authors and Affiliations

Natural Resources Canada, Canadian Forest Service, 1055 du P.E.P.S, Quebec, Quebec, G1V 4C7, Canada
Robert G Rutledge & Don Stewart

Authors

Robert G Rutledge
View author publications
You can also search for this author in PubMed Google Scholar
Don Stewart
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Robert G Rutledge.

Additional information

Authors' contributions

RGR conceived and led the project, contributed to primer design and data analysis, drafted the manuscript and wrote the Java program. DS conducted all of the real-time qPCR and LDA amplifications and much of the data analysis, in addition to maintaining the materials and primer database required by the large-scale Arabidopsis gene expression project from which the primers used in this study were derived.

Electronic supplementary material

12896_2008_316_MOESM1_ESM.doc

Additional file 1: Derivation of the two sigmoidal functions describing PCR amplification. A Microsoft Word summary of the rearrangements and substitutions used for conversion of the classic Boltzmann four parameter sigmoid function into a form in which PCR amplification can be modeled, based upon amplification dynamics as described by ΔE and E_max(equation 3). (DOC 136 KB)

12896_2008_316_MOESM2_ESM.xls

Additional file 2: LRE data processing templates. The Excel templates used for manual data processing required for optical calibration and cDNA quantification. (XLS 557 KB)

12896_2008_316_MOESM3_ESM.xls

Additional file 3: LRE data summary. An Excel summary of the LRE analysis, including all of the calculations used to derive the number of molecules for eleven cDNA targets amplified using five enzyme formulations. (XLS 142 KB)

12896_2008_316_MOESM4_ESM.jar

Additional file 4: Prototype Java program for conducting automated LRE analysis. Java program that automates LRE quantification; requires the latest Java(TM) SE Runtime Environment available for installation at java.com. (JAR 133 KB)

12896_2008_316_MOESM5_ESM.xls

Additional file 5: Representative amplification profiles for each of the eleven cDNA targets. Representative F_C datasets for the eleven cDNA targets for each of the five assay formulations. (XLS 126 KB)

12896_2008_316_MOESM6_ESM.xls

Additional file 6: Comparison of the Java-LRE quantification with LinReg and Miner. Excel datasheet summaries, which include all of the calculations used in the analysis for each of the three automated data processing packages. (XLS 228 KB)

12896_2008_316_MOESM7_ESM.xls

Additional file 7: Optical calibration amplification profiles. Excel datasheet summaries of the lambda gDNA amplification profiles used for optical calibration of the five assay formulations. (XLS 92 KB)

Authors’ original submitted files for images

Below are the links to the authors’ original submitted files for images.

Authors’ original file for figure 1

Authors’ original file for figure 2

Authors’ original file for figure 3

Authors’ original file for figure 4

Authors’ original file for figure 5

Authors’ original file for figure 6

Authors’ original file for figure 7

Authors’ original file for figure 8

Authors’ original file for figure 9

Authors’ original file for figure 10

Authors’ original file for figure 11

Authors’ original file for figure 12

Authors’ original file for figure 13

Rights and permissions

This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Rutledge, R.G., Stewart, D. A kinetic-based sigmoidal model for the polymerase chain reaction and its application to high-capacity absolute quantitative real-time PCR. BMC Biotechnol 8, 47 (2008). https://doi.org/10.1186/1472-6750-8-47

A kinetic-based sigmoidal model for the polymerase chain reaction and its application to high-capacity absolute quantitative real-time PCR

Abstract

Background

Results

Conclusion

Background

Results

Amplification efficiency is dynamic and is coupled to amplicon DNA quantity

Conformity of real-time amplification profiles generated with SYBR Green I

Derivation of a kinetic-based sigmoid model for the polymerase chain reaction

Implementation of LRE quantification

Initial setup

Target quantification, recursive analysis and the precision of LRE modeling

Maximizing optical precision

Derivation of absolute scale

Application to gene expression profiling

Absolute quantification via Ct

Absolute quantification via limiting dilution assay (LDA)

Recognized anomalies impacting LRE quantification

Compromised efficiency of target priming and elongation

Compromised optical integrity

Baseline drift

Automation

Comparison to other automated data processing packages

Discussion

What should be expected from real-time quantitative PCR?

The two founding principles of absolute quantification

Amplification efficiency determination

Derivation of absolute scale via optical calibration

Assay validation via limiting dilution assay

Conclusion

Methods

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Authors' contributions

Electronic supplementary material

Authors’ original submitted files for images

Rights and permissions

About this article

Cite this article

Share this article

Keywords

BMC Biotechnology

Contact us

Absolute quantification via C_t