Email updates

Keep up to date with the latest news and content from BMC Medical Research Methodology and BioMed Central.

Open Access Research article

Network meta-analysis of survival data with fractional polynomials

Jeroen P Jansen

Author Affiliations

Mapi Values, Boston, USA

BMC Medical Research Methodology 2011, 11:61  doi:10.1186/1471-2288-11-61

The electronic version of this article is the complete one and can be found online at: http://www.biomedcentral.com/1471-2288/11/61


Received:28 November 2010
Accepted:6 May 2011
Published:6 May 2011

© 2011 Jansen; licensee BioMed Central Ltd.

This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Abstract

Background

Pairwise meta-analysis, indirect treatment comparisons and network meta-analysis for aggregate level survival data are often based on the reported hazard ratio, which relies on the proportional hazards assumption. This assumption is implausible when hazard functions intersect, and can have a huge impact on decisions based on comparisons of expected survival, such as cost-effectiveness analysis.

Methods

As an alternative to network meta-analysis of survival data in which the treatment effect is represented by the constant hazard ratio, a multi-dimensional treatment effect approach is presented. With fractional polynomials the hazard functions of interventions compared in a randomized controlled trial are modeled, and the difference between the parameters of these fractional polynomials within a trial are synthesized (and indirectly compared) across studies.

Results

The proposed models are illustrated with an analysis of survival data in non-small-cell lung cancer. Fixed and random effects first and second order fractional polynomials were evaluated.

Conclusion

(Network) meta-analysis of survival data with models where the treatment effect is represented with several parameters using fractional polynomials can be more closely fitted to the available data than meta-analysis based on the constant hazard ratio.

Background

Healthcare decision-making requires comparisons of all relevant competing interventions. If the available evidence consists of a network of multiple randomized controlled trials (RCTs) involving treatments compared directly or indirectly or both, it can be synthesized by means of network meta-analysis [1-4]. Network meta-analysis of survival data is often based on the reported hazard ratio, which relies on the proportional hazards assumption.

The proportional hazards assumption that underlies current approaches of evidence synthesis of survival outcomes is not only often implausible, but can have a huge impact on decisions based on cost-effectiveness analysis. In extreme cases survival curves intersect and the hazard ratio is not constant. Furthermore, even if survival functions do not intersect, the hazard functions might and the assumption is violated. For cost-effectiveness evaluations of competing interventions that aim to improve survival, differences in expected survival between the competing interventions are of interest. Common practice is to assume a certain parametric survival function for the baseline intervention (e.g. Weibull) and apply the treatment specific constant hazard ratio obtained with the (network) meta-analysis to calculate a corresponding survival function enabling comparisons of expected survival. Since the tail of the survival function has a great impact on the expected survival, violations of the constant hazard ratio can lead to severely biased estimates. Hence, the proportional hazards assumption has become a source of concern in drug reimbursement based on cost-effectiveness evidence.

As an alternative to a network meta-analysis of survival data in which the treatment effect is represented by a single parameter, i.e. the hazard ratio, a multi-dimensional treatment effect approach is presented. With fractional polynomials the hazard over time is modeled by which the treatment effect is represented with multiple parameters [5]. With this approach a network meta-analysis of survival can be performed with models that can be fitted more closely to the data. With these parametric hazard functions, expected survival can be calculated to facilitate cost-effectiveness analysis. The method is illustrated with an example.

Methods

Fractional polynomials and the hazard function

Royston and Altman introduced fractional polynomials as an extension of polynomial models for determining the functional form of a continuous predictor [5]. These models are well suited for nonlinear data. In contrast to categorizing continuous predictors, the analysis is no longer dependent on the number and choice of cut points [6]. Fractional polynomials have been used in many applications including survival and meta-regression analysis [7-9].

By transforming t, a continuous variable, in a linear model the first-order fractional polynomial model is obtained:

(1)

The power p is chosen from the following set: -2. -1, -0.5, 0, 0.5, 1, 2, 3 with t0 = log t

The second order fractional polynomial is defined as:

(2)

If p1 = p2 = p the model becomes a 'repeated powers' model:

(3)

Royston and Altman showed that by varying p1 and p2 and the parameters β0, β1 and β2 a wide range of curve shapes can be obtained [5,6,8,10,11].

The first order fractional polynomial for the hazard at time t of a two arm treatment B versus A randomized controlled trial can be presented as follows:

(4)

where: hkt reflect the hazard with treatment k at time t. The vector reflects the parameters β0 and β1 of the 'baseline' treatment A, whereas the vector reflects the difference in β0 and β1 of the log hazard curve for treatment B relative to A. The parameter d0 corresponds to the treatment effect with a proportional hazard model. Under the proportional hazards assumption d1 equals 0. If d1 ≠ 0, d1 reflects the change in the log hazard ratio over time. Hence, by incorporating d1 in addition to d0 a multi-dimensional relative treatment effect is used rather than single parameter for the relative treatment effect.

Hazard functions can have different shapes, including a constant hazard over time, a linear increasing or decreasing hazard over time, and bathtub shaped. If in equation 4 β1 equals 0, a constant log hazard function is obtained, reflecting exponentially distributed survival times. If β1 ≠ 0 and p = 1 a linear hazard function is obtained which corresponds to a Gompertz survival function. If β1 ≠ 0 and p = 0 a Weibull hazard function is obtained, and reflects the difference in respectively the scale and shape of the Weibull log hazard curve for treatment B relative to A. Extending the first-order fractional polynomial hazard function to a second-order fractional polynomial increases the possible (differences in) shapes even further. Hence, modeling the hazard function of competing interventions with fractional polynomials provides a general framework that includes some of the commonly used parametric survival functions and does not rely on the constant hazard ratio assumption.

Network meta-analysis model for survival data using fractional polynomials

Network meta-analysis has been presented as an extension of traditional meta-analysis by including multiple different pairwise comparisons across a range of different interventions. Meta-analysis models for the comparison of treatment B versus A can be extended to models allowing simultaneous comparisons of B versus A as well as C versus A [1-4]. To appreciate the randomization of the different studies in the evidence synthesis, a study of a certain pairwise comparison has to be 'linked' to any of the other studies in the network. When the network consists of AB-trials, AC-trials, as well as BC trials, we have a mixture of direct and indirect comparisons and these analyses have been called mixed treatment comparisons (MTC) [3].

For a network meta-analysis, the similarity and consistency relation needs to hold regarding the estimated model parameters [3,12,13]. If AB trials and AC trials are comparable on effect modifiers (i.e. covariates that affect the relative treatment effect), then an indirect estimate for the relative effect of C versus B (dBC) can be obtained from the estimates of the effect of B versus A (dAB) and the effect of C versus A (dAC): dBC = dAC - dAB. In essence, this implies that the same dBC is obtained as would have been estimated in a three arm randomized ABC trial. In general, for a model described by the function fx(t) where x = A, B, or C, we have: (fC(t)- fB(t)) = (fC(t)- fA(t))-(fB(t)- fA(t)). For a network meta-analysis of survival data, the comparison can be performed on the log hazard ratio, and this relation needs to apply to every timepoint t: ln(HRBC(t))- ln(HRAC(t))- ln(HRAB(t)) with HRBC(t) reflecting the hazard ratio of C relative to B at time t. Based on equation 4 it follows that:

(5)

Hence, the differences in the model parameters β0 and β1 of the first order fractional polynomials are independent of time. Furthermore, according to equation 5 the difference in β0 and β1 of the BC comparison can be described by the difference in these parameters for the AC comparison and AB comparison. Given this relation, a network meta-analysis can be performed based on the differences in β0 and β1 of log hazard curves across studies. Similarly, the transitivity assumption holds for fractional polynomials of any order.

Using a similar notation as Cooper et al. [13], the random effects model for a network meta-analysis of survival data based on a fractional polynomial of order M for k treatments labeled A, B, C, etc can be described as:

(6)

where hjkt reflects the underlying hazard rate in trial j for intervention k at time point t. The vectors are trial-specific and reflect the parameters β0, β1,..., βM of the comparator treatment, whereas the vectors reflect the study specific difference in β0, β1,..., βM of the log hazard curve for treatment k relative to comparator treatment b. and are drawn from a multivariate normal distribution with the pooled estimates expressed in terms of the overall reference treatment A with . For example, , , etc. Σ is the between study covariance matrix to reflect heterogeneity which is assumed constant for all treatment comparisons where σm represent the variance for δmjbk (i.e. the difference in βm) and ρ01, ρ02,..., ρM-1,Mis the correlation between these parameters. Of key interest from the analyses are the pooled estimates of dmAk and estimates for the heterogeneity. Please note that the HR is changing over time once dm≥1 is different from 0.

Under a fixed effects model the multivariate normal distribution with the pooled estimates will be replaced with and as a result the between study covariance matrix does not need to be estimated. When only for d0Ak heterogeneity is assumed, and the other effect parameters d1Ak,..., dMAk are fixed, then is replaced with δ0jbk~Normal(d0Ak-d0Ab, σ2) and .

A random effects model with only a heterogeneity parameter for d0Ak implies that the between study variance of the log hazard ratios remains constant over time. Random effects models with (additional) heterogeneity parameters for d1Ak,..., dMAk have the flexibility to capture between study variance regarding changes in the log hazard ratios over time.

The random effects fractional polynomial model in equation 6 treats multiple-arm trials (>2 treatments) without taking account of the correlations between the trial-specific δs that they estimate. Bayesian random effects fractional polynomials models with only a heterogeneity parameter for d0Ak can be easily extended to fit trials with 3 or more treatment arms by decomposition of a multivariate normal distribution as a series of conditional univariate distributions [13]. If then the conditional univariate distributions for arm i given the previous 1,....(i-1) arms are:

Different values for the powers p1 and p2 of the fractional polynomials correspond to different models. The best fitting model can be selected based on goodness-of-fit comparisons. The goodness of fit can be computed as the difference between the deviance for the fitted model and the deviance for the saturated model (which fits the data perfectly). Within a frequentist framework the Akaike information criterion (AIC) can be used for model selection [14]. In a Bayesian framework the Bayesian information criterion (BIC) or deviance information criterion (DIC) can be used [15,16].

Illustrative example

To understand how the analytical approach proposed can be applied in practice, an example is presented for oncology where trials are typically focused on overall (and progression free) survival.

Lung cancer is a leading cause of cancer mortality in both men as well as women, with non-small cell lung carcinoma (NSCLC) accounting for 80% of all cases [17]. Second line treatment for advanced NSCLC includes docetaxel and pemetrexed [18]. Gefitinib has been studied as second line treatment as well.

A literature search identified seven RCTs comparing docetaxel with best-supportive care (1 study), gefitinib with best-supportive care (1 study), docetaxel with gefitinib (4 studies), and docetaxel with pemetrexed (1 study) [19-25]. The network of RCTs is presented in Figure 1 and shows that for the comparisons of BSC, docetaxel and gefitinib both direct and indirect evidence is available. For each treatment arm in each study reported Kaplan-Meier curves were digitized (Engauge Digitaliser v4.1) In Figure 2 the scanned survival proportions are presented. This aggregate data was analyzed with fractional polynomial network meta-analysis models.

thumbnailFigure 1. Network of randomized controlled trials.

thumbnailFigure 2. Survival as observed in individual studies.

Whilst network meta-analysis can be performed with a frequentist or a Bayesian approach, for this manuscript the focus is on the Bayesian approach. Within the Bayesian framework, analyses consist of data, likelihood, parameters, a model, and prior distributions. More specifically, Bayesian analysis involves the formal combination of a prior probability distribution that reflects a prior belief of the possible values of the parameters of the model with a likelihood distribution of the model parameters based on the observed data in the different studies to obtain a posterior probability distribution of these [26-28].

The scanned survival curves can be divided into multiple consecutive intervals over the follow-up period. Extracted survival proportions were used to calculate the incident number of deaths for each interval and patients at risk at the beginning of that interval. A binomial likelihood distribution of the incident number of deaths for every interval [t,t+Δt] (Δt is the time from t to t+1) of the Kaplan-Meier curves can be described according to:

(7)

Where rjkt is the observed number of incident deaths in the interval [t,t+Δt] for study j and treatment k. njkt Is the number of subjects alive at t, adjusted for the subjects censored in the interval [t,t+Δt]. pjkt is the observed cumulative incidence of deaths in the interval [t,t+Δt]. In the appendix more detail is provided how a dataset for njkt and rjkt can be obtained from the Kaplan-Meier curve taking into account censoring in the interval [t,t+Δt]. In Table 1 the incident deaths and patients at risk for every 2-month period of the individual studies are presented. When the time interval is relatively short, the hazard rate can be assumed constant within the time interval, and the hazard rate hjkt is:

Table 1. Number of deaths and patients at risk for every consecutive 2-month period as extracted from digitized survival curves (see appendix for details)

(8)

In this example fixed and random effects first and second order fractional polynomial models were used with powers chosen from the following set: -2. -1, -0.5, 0, 0.5, 1, 2, 3 with t0 = log t according to eq. 6. Two different random effects second order fractional polynomial models were compared: one model with a heterogeneity parameter for d0, and one model with heterogeneity parameters for all three treatment parameters (d0, d1 or d2). Although random effects models with a heterogeneity parameter for only d1 or d2 can be estimated as well, these were considered less appropriate because these models assume that heterogeneity in treatment effects only develop over time, and is not present at treatment initiation. In other words: heterogeneity is only a function of time, and not (also) a function of differences in patient characteristics across studies. If only one heterogeneity parameter is (to be) used, it should be for d0 because it assumes constant variance for the complete follow-up period.

The non-informative prior distributions as used for the parameters of the random effects second-order fractional polynomial model with heterogeneity corresponding to d0, d1 and d2 are presented (according to equation 6):

(9)

For a first order fractional polynomial model these 3-dimensional multivariate prior distributions are reduced to bivariate normal distributions. With a random effects model, where only for d0 a heterogeneity parameter is used, the corresponding prior distribution can be defined as σ ~ uniform(0,2). When all relative effects parameters are assumed fixed, there is no heterogeneity to be estimated, and no such prior distribution needs to be defined.

The parameters of the different models were estimated using a Markov Chain Monte Carlo (MCMC) method as implemented in the WinBUGS software package [29]. (See appendix for the code.) The WinBUGs sampler, using two chains, was run for 30000 iterations for the models and these were discarded as 'burn-in' and the model was run for a further 50 000 iterations on which inferences were based. Convergence of the chains was confirmed by the Gelman-Rubin statistic.

The DIC was used to compare the goodness-of-fit of different fixed and random effects models with first and second order fractional polynomials with different powers. DIC provides a measure of model fit that penalizes model complexity according to [16]. is the posterior mean residual deviance [15], pD is the 'effective number of parameters' and is the deviance evaluated at the posterior mean of the model parameters. The model with the lowest DIC, is the model providing the 'best' fit to the data. For every combination of p1 and p2 the DIC was determined. The powers p1 and p2 corresponding to the best fitted fixed effects models were also used to evaluate corresponding random effects models.

Results

Illustrative example

The model fit statistics for the different models are presented in Table 2. The fixed effects Weibull model (p1 = 0) was one of the worst regarding goodness-of-fit. Of the first order fractional polynomial models, the model with power p1 = -2 was the best fit. Adding a second time- related effect to this first order fractional polynomial model dramatically improved the model fit. Although the model with p1 = -2 and p2 = 1 has the lowest DIC of all the fixed effects models evaluated, the model with p1 = -2 and p2 = 2 and the model with p1 = -2 and p2 = 3 deserve consideration as well because these are within 1-2 points of the "best" model [16]. However, the modeled hazard function with p2 = 1 is not as sensitive to small sample fluctuations near the end of the follow-up of each study as the models with p2 = 2 or p2 = 3. To facilitate the extrapolation of the survival curves beyond the trial period, the model with p1= -2 and p2 = 1 was considered the most appropriate fixed effects model. The corresponding random effects models showed similar values for the DIC, and as such the random effects models were considered more appropriate. The model with a heterogeneity parameter for d0 only showed more stable parameter estimates than the random effects model with heterogeneity parameters for d0, d1 and d2. Given the similar fit of these random effect models, the model with one heterogeneity parameter was used.

Table 2. Goodness-of-fit estimates for fixed and random effects fractional polynomial models for different powers p1 and p2.

Table 3 provides parameter estimates for the fixed effects first and second order fractional polynomial models with p1 = -2 and p2 = 1, as well as the corresponding random effects model with a heterogeneity parameter for d0 . Based on the pooled relative treatment effects regarding β0, β1 and β2 of each intervention relative to docetaxel (d0Ak, d1Ak, and d2Ak with k = B,C,D corresponding to respectively gefitinib, BSC, and pemetrexed) the corresponding hazards ratios as a function of time were obtained: ln(HRAk) = d0Ak + d1Ak · t-2 + d2Ak · t. The hazard ratios over time obtained with the random effects model are presented in Figure 3. It is obvious that the assumption of constant hazards ratio does not apply to any comparison with BSC involved. Although for the comparison of gefitinib relative to docetaxel a constant hazard ratio over time might be defended, the additional indirect evidence via BSC for this comparison clearly does not allow this assumption. Based on this observation, one can argue that d1 and d2 for gefitinib and pemetrexed relative to docetaxel can be set to zero, and that d1 and d2 only need to be estimated for BSC versus docetaxel. However, it has to be realized that by making that assumption the uncertainty regarding the proportional hazards assumption for gefitinib and pemetrexed is no longer taken into consideration.

Table 3. Model parameter estimates for different fractional polynomial network meta-analysis models.

thumbnailFigure 3. Hazard ratio over time for each of the interventions relative to docetaxel as obtained with random effects second order fractional polynomial (p1 = -2, p2 = 1) network meta-analysis model. (Corresponding parameter estimates are presented in Table 3: d0Ak, d1Ak, d2Ak)

In the example there is both direct evidence (i.e. head-to-head studies) and indirect evidence (via BSC) for the comparison of gefitinib versus docetaxel. As such, the network meta-analysis combining both direct and indirect comparisons uses more information than a pairwise meta-analysis of the 4 gefitinib versus docetaxel studies. In Figure 4, the hazard ratio over time is presented for the pairwise meta-analysis of gefitinib versus docetaxel based on 4 studies, as well as the mixed treatment comparison. The estimates of the two analyses are comparable (at least from month 3 onwards) suggesting that inconsistency between direct and indirect estimates is not an issue of concern. However, the uncertainty of the hazard ratio over time is greater with the pairwise meta-analysis of 4 studies than the network meta-analysis of 6 studies. By incorporating indirect evidence the parameters of the fractional polynomial can be estimated more precisely in this example.

thumbnailFigure 4. Estimation of hazard ratio over time for gefitinib versus docetaxel as obtained with mixed treatment comparison model (4 gefitinib-docetaxel studies, 1 BSC-docetaxel study, 1 gefitinib-BSC study) is associated with less uncertainty than obtained with a meta-analysis model (4 Gefitinib-Docetaxel studies).

By using the average of study specific estimates for β0, β1 and β2 with docetaxel as the reference, the expected β0, β1 and β2 for the other interventions were calculated using the relative treatment effects d0Ak, d1Ak, and d2Ak. (See Table 4) The corresponding hazard and survival functions for each of the four interventions are presented in Figure 5 and 6A. With these parametric survival curves it is now possible to calculate the expected survival (i.e. the area under the curve) which is presented in Table 4 as well.

Table 4. Functions of parameter estimates for different fractional polynomials

thumbnailFigure 5. Hazard over time for each of the interventions as obtained with random effects second order fractional polynomial (p1 = -2, p2 = 1) network meta-analysis model. Docetaxel hazard curve used as 'anchor'.

thumbnailFigure 6. Survival over time for each of the interventions as obtained with A) random effects second order fractional polynomial (p1 = -2, p2 = 1) network meta-analysis model; and B) random effects proportional hazards model assuming Weibull distribution.

When, as is common practice for cost-effectiveness analysis, a constant hazards ratio in combination with a Weibull distribution was assumed, the DIC of the model was 959.1. The fitted survival curves for docetaxel, gefitinib, BSC, and pemetrexed are presented in Figure 6B. The expected survival was respectively 15.1, 14.5, 8.0, and 15.2 months, and shows the overestimate relative to the random effects second order fractional polynomial model. The greatest difference is observed for the BSC survival curve, and the tails of the active interventions. To illustrate that the fractional polynomials produce a visibly better fit to the data than a simple model like the Weibull with a proportional hazards assumption, these models are presented for 3 studies in Figure 7. For the other 4 studies, the difference between the fractional polynomial curves and Weibull curves was not as great.

thumbnailFigure 7. Three representative studies that illustrate that a constant hazard ratio in combination with a Weibull reference curve does not fit the data as closely as the fractional polynomial models.

Discussion

In this paper a method for (network) meta-analysis of survival data using a multi-dimensional treatment effect is presented as an alternative to synthesis of the constant hazards ratio. With first or second order fractional polynomials the hazard functions of the interventions compared in a trial are modeled and the difference in the parameters of these fractional polynomials within a trial are considered the multidimensional treatment effect and synthesized (and indirectly compared) across studies. In essence, with this approach the treatment effects are represented with multiple parameters rather than a single parameter or outcome.

Meta-analysis of survival data using the constant hazards ratio can be considered a special case of the model presented here. When in equation 6 d1Ak, d2Ak, ...dMAk equal 0, only the time independent parameters β0jk can be different across treatments within a trial and accordingly d0Ak reflect the constant log hazard ratio of treatment k relative to A. (Please note that the baseline hazard can still be modelled with multiple β1jk, β2jk, ..., βMjk that can be different from 0, but these are constant across all interventions within a trial. With a Cox proportional hazards model the baseline hazard is unconstrained and not described by parametric distribution or function.) The advantage of the approach presented here is that it does not rely on the proportional hazards assumption and as a result the model used can be more closely fitted to available survival data. In a situation, where the violation of the proportional hazard ratio is less clear due to limitations of the data, it still can be considered useful modeling a multi-dimensional treatment effect to express the uncertainty in the violation of the assumption of proportional hazards.

For network meta-analysis it is important that for the relative effect measure of interest the transitivity assumption holds [3,12,13]. Although the transitivity assumption holds for the constant (log) hazards ratio, violations of the proportional hazards assumption within or across trials, can result in biased indirect and mixed treatment comparisons of relative survival over time. By incorporating additional parameters for the treatment effect, the proportional hazards assumption is relaxed and therefore indirect and mixed treatment comparisons are arguably less likely to result in biased indirect estimates.

With a (network) meta-analysis the value of randomization only holds within a trial, and not across trials [3,12,13]. In other words, patients are randomly assigned to treatments within a trial, but patients are not randomly assigned to different trials. As a result there is the risk that patients assigned to the different trials are not comparable. If the distribution of patient and study level characteristics that modify the relative treatment effects is not similar across trials indirectly compared results will be affected by confounding bias [13]. In the models presented in this paper, treatment effect estimates will be biased if there is an imbalance in the distribution of treatment*covariate interactions across studies regarding the multidimensional treatment effect. Hence, it is suggested to expand the current models by incorporating treatment*covariate interactions. An additional advantage is that it can explain heterogeneity and facilitates the prediction of expected survival for subgroups [13].

In the example analysis, aggregate level data, i.e. scanned Kaplan-Meier curves, were used for all interventions compared. However, the models can also be used in combination with individual patient level data, using a different likelihood. Patient-level analyses have the advantage that no (conservative) assumption has to be made regarding the censoring process. Furthermore, patient-level network meta-analyses have greater power to estimate meta-regression models thereby reducing inconsistency and providing the opportunity to explore differences in effect among subgroups. However, obtaining patient-level data for all RCTs in the network may be considered infeasible. As an alternative one could use patient-level data when available, and aggregate level data for studies in the network for which such data is not available thereby improving parameter estimation over aggregate-data-only models.

Drug coverage decision-making is often informed by cost-effectiveness analysis where expected costs and expected outcomes are compared. When the main objective of the competing interventions is to improve survival, the primary outcome of interest is expected survival or for-quality-of-life adjusted expected survival. Unfortunately, given the available follow-up in the clinical trials, survival data is often censored and therefore the expected survival cannot be obtained without extrapolation of the data over time. Standard practice is to extrapolate the available survival data for the reference treatment using a parametric survival function (e.g. Weibull, lognormal or log-logistic). This baseline hazard function is multiplied with the constant hazard ratio for each of the competing interventions relative to this baseline to obtain hazard functions for the interventions of interest. The assumption of a constant hazards function implies that only the scale of these parametric functions is affected by treatment, and accordingly all the competing interventions have the same shape. Since the tail of the survival function has a great impact on the expected survival this assumption may lead to biased or at least highly uncertain estimates regarding differences in expected survival and therefore cost-effectiveness estimates. Given the multi-dimensional treatment effect of the approach presented in this paper, the parametric hazards functions of the competing interventions can be different regarding all of their parameters. As a result the extrapolated survival functions for all the interventions are more closely fitted to the available data and expected survival is less likely to be over or underestimated. An additional advantage of the use of fractional polynomials is that models can be fitted that go to asymptotes, and are therefore far more stable at the ends than, say, standard polynomials or splines. Although the proposed models constitute a substantial liberalization for evidence synthesis of survival curves from RCTs, there is still a danger of under-stating the uncertainty in extrapolating the curves because the choice of fractional polynomials is based on model fit criteria. In order to reflect model uncertainty, it might be of interest to estimate the powers of the fractional polynomials as well.

Conclusions

(Network) meta-analysis of survival data is commonly performed with models represented with one parameter for the relative treatment effect: the constant hazard ratio. When the proportional hazards assumption does not hold, models in which the treatment effect is represented by several parameters using fractional polynomials can be more closely fitted to the available data. The models allow straightforward estimation of expected survival to facilitate cost-effectiveness analysis.

Abbreviations

AIC: Akaike information criterion; BSC: best-supportive care; DIC: deviance information criterion; NSCLC: non-small cell lung carcinoma; RCT: randomized controlled trial

Competing interests

The author declares that they have no competing interests.

Authors' contributions

JJ is responsible for the development of the concept and methods, analysis of the example and writing of the manuscript

Appendix

Extraction of data from survival curves to use in the network meta-analysis model

According to the Kaplan-Meier curve, the proportion of people alive at time point t St that die between time point t and time point t + 1 is equal to (St - St+1)/St and can be described by binomial likelihood distribution: rt ~ bin(pt, nt). Where is the number of deaths rt in the interval [t,t+1]. nt is the number of subjects at risk in that interval, and pt is the underlying risk.

In the absence of censoring for the interval [t,t+1], nt is the number at risk at the beginning of the interval and rt can be obtained by multiplying nt with (St - St+1)/St.

The number at risk for a particular interval might be provided below the Kaplan-Meier graph; if not reported, it can be obtained according to starting at the time point where nt is provided below the graph.

In the case of censoring, the overlap of the sequence of censoring and deaths within the time interval [t,t+1] is unclear, and it is not possible to derive the exact number of deaths and censoring in the interval. As extreme cases we can assume that, on the one hand, censoring occurs after the deaths within the interval, or, on the other hand, all censoring occurs before the deaths. In the first scenario nt is the number at risk at the beginning of the interval, whereas in the second scenario nt is the number at risk at the beginning of the interval minus the number of censored subjects. With the second scenario it is clear that nt and rt are smaller given (St - St+1)/St resulting in more uncertainty regarding the estimate pt. To not underestimate the uncertainty we opted for the second scenario. Under the assumption that all censoring occurs before the deaths occur, nt can again be obtained by with nt+1 reported below the graph, or based on the same calculation for the interval [t+1, t+2], etc.

Winbugs code for second order fractional polynomial random effects network meta-analysis model

Model{

for (i in 1:N){         # N number of datapoints in dataset

# time is expressed in months and transformed according powers of fractional polynomial P1 and P2

time_transf1[i]<-(equals(P1,0)*log(time[i]) + (1-equals(P1,0))*pow(time[i],P1))

time_transf2[i]<-((1-equals(P2,P1))*(equals(P2,0)*log(time[i]) + (1-equals(P2,0))*pow(time[i],P2)) + equals(P2,P1)*(equals(P2,0)*log(time[i])*log(time[i]) + (1-equals(P2,0))*pow(time[i],P2) *log(time[i])))

# likelihood

# hazard over interval [t,t+dt] expressed as deaths per person-month

# r is deaths in interval, n is number at risk, h is hazard

r[i]~ dbin(p[i],n[i])

p[i]<-1-exp(-h[i]*dt) # cumulative hazard over interval [t,t+dt] expressed as deaths per person-month

# random effects model

# loop over datapoints

# s refers to study, k is intervention k, b is comparator

log(h[i])<-Beta[i,1]+ Beta[i,2]*time_transf1[i]+ Beta[i,3]* time_transf2[i]

Beta[i,1]<-mu[s[i],1]+delta[s[i],1]*(1-equals(k[i],b[i]))

Beta[i,2]<-mu[s[i],2]+delta[s[i],2]*(1-equals(k[i],b[i]))

Beta[i,3]<-mu[s[i],3]+delta[s[i],3]*(1-equals(k[i],b[i]))

}

# loop over studies

# NS is number of studies

# ks is intervention k, bs is comparator

for(m in 1:NS){

delta[m,1:3]~dmnorm(md[k,1:3],omega[1:3,1:3])

md[m,1]<-d[ks[m],1]-d[bs[m],1]

md[m,2]<-d[ks[m],2]-d[bs[m],2]

md[m,3]<-d[ks[m],3]-d[bs[m],3]

}

# priors

# NT is number of treatments

d[1,1]<-0

d[1,2]<-0

d[1,3]<-0

for(j in 2:NT){

d[j,1:3] ~ dmnorm(mean[1:3],prec2[,])

}

for(k in 1:NS){

mu[k,1:3] ~ dmnorm(mean[1:3],prec2[,])

}

omega[1:3, 1:3] ~ dwish(R[1:3,1:3],3)

# output SD and correlation based on estimated covariance matrix

sigma.theta[1:3,1:3] <- inverse(omega[1:3,1:3])

rho[1,2] <-sigma.theta[1,2]/sqrt(sigma.theta[1,1]*sigma.theta[2,2])

rho[1,3] <-sigma.theta[1,3]/sqrt(sigma.theta[1,1]*sigma.theta[3,3])

rho[3] <-sigma.theta[3]/sqrt(sigma.theta[2,2]*sigma.theta[3,3])

sd[1]<-sqrt(sigma.theta[1,1])

sd[2]<-sqrt(sigma.theta[2,2])

sd[3]<-sqrt(sigma.theta[3,3])

# output hazard ratio for month 1 to 60

# NT is number of treatments, c is reference treatment, k is treatment of interest, l is month

for (c in 1:(NT-1)) {

for (j in (c+1):NT) {

for (l in 1:60) {

t1[l]<-(equals(P1,0)*log(l) + (1-equals(P1,0))*pow(l,P1))

t2[l]<-((1-equals(P2,P1))*(equals(P2,0)*log(l) + (1-equals(P2,0))*pow(l,P2)) + equals(P2,P1)*(equals(P2,0)*log(l)*log(l) + (1-equals(P2,0))*pow(l,P2) *log(l)))

log(hazard_ratio[c,j,l])<-d[j,1]-d[c,1]+(d[j,2]-d[c,2])*t1[l]+(d[j,3]-d[c,3])*t2[l]

}}}

}

WInbugs data structure

s[] study identifier

r[] incident cases in interval

n[] at risk at beginning of interval

k[] treatment

b[] comparator

time[] interval number/time point

s[]   r[]   n[]   k[]   b[]   time[]

1   7   81   2   1   1

1   5   74   2   1   2

1   7   69   2   1   3

1   6   62   2   1   4

.   .   .   .   .   .

.   .   .   .   .   .

.   .   .   .   .   .

7   53   134   3   1   3

7   70   160   3   1   4

7   12   90   3   1   5

7   22   78   3   1   6

END

# comparison by study (only used for random effects model)

ks[]   bs[]

2   1

3   2

2   1

2   1

4   1

2   1

3   1

END

Acknowledgements and funding

The research was performed without specific funding.

References

  1. Higgins JPT, Whitehead A: Borrowing strength from external trials in a meta-analysis.

    Statistics in Medicine 1996, 15:2733-2749. PubMed Abstract | Publisher Full Text OpenURL

  2. Lumley T: Network meta-analysis for indirect treatment comparisons.

    Statistics in Medicine 2002, 21:2313-2324. PubMed Abstract | Publisher Full Text OpenURL

  3. Lu G, Ades AE: Combination of direct and indirect evidence in mixed treatment comparisons.

    Statistics in Medicine 2004, 23:3105-3124. PubMed Abstract | Publisher Full Text OpenURL

  4. Caldwell DM, Ades AE, Higgins JPT: Simultaneous comparison of multiple treatments: combining direct and indirect evidence.

    British Medical Journal 2005, 331:897-900. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  5. Royston P, Altman DG: Regression using fractional polynomials of continuous covariates: parsimonious parametric modelling (with discussion).

    Applied Statistics 1994, 43:429-467. Publisher Full Text OpenURL

  6. Lambert PC, Smith LK, Jones DR, Botha JL: Additive and multiplicative covariate regression models for relative survival incorporating fractional polynomials for time-dependent effects.

    Statistics in Medicine 2005, 24:3871-3885. PubMed Abstract | Publisher Full Text OpenURL

  7. Bossard N, Descotes F, Bremond AG, Bobin Y, De Saint Hilaire P, Golfier F, et al.: Keeping data continuous when analyzing the prognostic impact of a tumor marker: an example with cathepsin D in breast cancer.

    Breast Cancer Research and Treatment 2003, 82:47-59. PubMed Abstract | Publisher Full Text OpenURL

  8. Berger U, Schafer J, Ulm K: Dynamic Cox modelling based on fractional polynomials: time-variations in gastric cancer prognosis.

    Statistics in Medicine 2003, 22:1163-1180. PubMed Abstract | Publisher Full Text OpenURL

  9. Bagnardi V, Zambon A, Quatto P, Corrao G: Flexible meta-regression functions for modeling aggregate dose response data, with an application to alcohol and mortality.

    American Journal of Epidemiology 2004, 159:1077-1086. PubMed Abstract | Publisher Full Text OpenURL

  10. Sauerbrei W, Royston P: Building multivariable prognostic and diagnostic models: transformation of the predictors by using fractional polynomials.

    JRSSA 1999, 162:71-94. OpenURL

  11. Sauerbrei W, Royston P, Look M: A new proposal for multivariable modelling of time-varying effects in survival data based on fractional polynomial time-transformation.

    Biom J 2007, 9:453-473. OpenURL

  12. Lu G, Ades AE: Assessing evidence inconsistency in mixed treatment comparisons.

    J Am Stat Assoc 2006, 101:447-459. Publisher Full Text OpenURL

  13. Cooper NJ, Sutton AJ, Morris D, Ades AE, Welton NJ: Addressing between-study heterogeneity and inconsistency in mixed treatment comparisons: Application to stroke prevention treatments in individuals with non-rheumatic atrial fibrillation.

    Stat Med 2009, 28:1861-81. PubMed Abstract | Publisher Full Text OpenURL

  14. Akaike H: Information theory and an extension of the maximum likelihood principle.

    Second International Symposium on Information Theory 1973, 1:267-281. OpenURL

  15. Dempster AP: The direct use of likelihood for significance testing.

    Statistics and Computing 1997, 7:247-252. Publisher Full Text OpenURL

  16. Spiegelhalter DJ, Best NG, Carlin BP, van der Linde A: Bayesian measures of model complexity and fit.

    Journal of the Royal Statistical Society, Series B 2002, 64:583-639. Publisher Full Text OpenURL

  17. Molina JR, Yang P, Cassivi SD, Schild SE, Adjei AA: Non-small cell lung cancer: epidemiology, risk factors, treatment, and survivorship.

    Mayo Clin Proc 2008, 83:584-94. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  18. De Lima Araújo LH, Ferreira CG: Platinum-based second-line treatment in non-small-cell lung cancer: an old new kid on the block?

    J Clin Oncol 2010, 10(28):e24-5. OpenURL

  19. Chang A, Parikh P, Thongprasert S, Tan E, Perng R, Ganzon D, et al.: Gefitinib IRESSA in patients of Asian origin with refractory advanced non-small cell lung cancer: subset analysis from the ISEL study.

    Journal of thoracic oncology: official publication of theInternational Association for the Study of Lung Cancer 2006, 1:847-55. OpenURL

  20. Cufer T, Vrdoljak E, Gaafar R, Erensoy I, Pemberton K, SIGN Study Group: Phase II, open-label, randomized study SIGN of single-agent gefitinib IRESSA or docetaxel as second-line therapy in patients with advanced stage IIIb or IV non-small-cell lung cancer.

    Anti-cancer drugs 2006, 17:401-9. PubMed Abstract | Publisher Full Text OpenURL

  21. Hanna N, Shepherd FA, Fossella FV, Pereira JR, De Marinis F, von Pawel J, et al.: Randomized phase III trial of pemetrexed versus docetaxel in patients with non-small-cell lung cancer previously treated with chemotherapy.

    J Clin Oncol 2004, 22:1589-97. PubMed Abstract | Publisher Full Text OpenURL

  22. Kim E, Hirsh V, Mok T, Socinski M, Gervais R, Wu Y, et al.: Gefitinib versus docetaxel in previously treated non-small-cell lung cancer INTEREST: a randomised phase III trial.

    Lancet 2008, 372:1809-18. PubMed Abstract | Publisher Full Text OpenURL

  23. Lee D, Park K, Kim J, Lee J, Shin S, Kang J, et al.: Randomized Phase III trial of gefitinib versus docetaxel in non-small cell lung cancer patients who have previously received platinum-based chemotherapy.

    Clinical cancer research 2010, 16:1307-14. PubMed Abstract | Publisher Full Text OpenURL

  24. Maruyama R, Nishiwaki Y, Tamura T, Yamamoto N, Tsuboi M, Nakagawa K, et al.: Phase III study, V-15-32, of gefitinib versus docetaxel in previously treated Japanese patients with non-small-cell lung cancer.

    Journal of clinical oncology: official journal of the AmericanSociety of Clinical Oncology 2008, 26:4244-52. OpenURL

  25. Shepherd FA, Dancey J, Ramlau R, et al.: Prospective randomized trial of docetaxel versus best supportive care in patients with non-small-cell lung cancer previously treated with platinum-based chemotherapy.

    J Clin Oncol 2000, 18:2095-103. PubMed Abstract | Publisher Full Text OpenURL

  26. Ades AE, Sculpher M, Sutton AJ, Abrams K, Cooper N, Welton N, Lu G: Bayesian Methods for Evidence Synthesis in Cost-Effectiveness Analysis.

    Pharmacoeconomics 2006, 24:1-19. PubMed Abstract | Publisher Full Text OpenURL

  27. Spiegelhalter DJ, Abrams KR, Myles JP: Bayesian approaches to clinical trials and health-care evaluations. Chichester: John Wiley & Sons; 2004:80-85.

  28. Spiegelhalter DJ, Abrams KR, Myles JP: Bayesian approaches to clinical trials and health-care evaluations. Chichester: John Wiley & Sons; 2004:286.

  29. Spiegelhalter D, Thomas A, Best N, Lunn D: WinBUGS User Manual: Version 1.4. MRC Biostatistics Unit: Cambridge; 2003. OpenURL

Pre-publication history

The pre-publication history for this paper can be accessed here:

http://www.biomedcentral.com/1471-2288/11/61/prepub