Highlights
- •Network meta-analysis (NMA) of time-to-event outcomes based on constant hazard ratios can result in biased findings when the proportional hazards (PHs) assumption does not hold in a subset of the trials. Several NMA methods that do not rely on the PH assumption have been proposed. Nevertheless, their application to health technology assessment submissions has been limited, possibly due to a lack of familiarity among researchers performing cost-effectiveness analysis of oncology drugs.
- •NMA methods that allow for non-PH may provide relative treatment effect estimates that better reflect the evidence. We provide a systematic overview and comparison of methods using a case study to increase familiarity. Given the impact of NMA model choice on findings, we propose a stepwise process to select appropriate NMA models. This is especially important when considering the most flexible NMA models.
- •When relative treatment effects in terms of time-to-event outcomes need to be estimated based on an NMA of randomized controlled trials and the PH assumption is uncertain in one or more of the randomized controlled trials, alternative parametric NMA methods that do not rely on this assumption are recommended.
Abstract
Objectives
Methods
Results
Conclusions
Keywords
Introduction
NMA Methods That Do Not Rely on the PH Assumption
- Connock M.
- Armoiry X.
- Tsertsvadze A.
- et al.
Methods | Description of NMA model | Survival distribution/function | One- or 2-step | Framework | Likelihood | Relative treatment effect and how is non-PH addressed | Between-study heterogeneity | Inconsistency models | Meta-regression |
---|---|---|---|---|---|---|---|---|---|
Ouwens et al 6 | Uses a multivariate relative treatment effect as an alternative to the synthesis of the trial-specific constant HRs. The hazard functions of the interventions in a trial are modeled using parametric distribution and the difference in the parameters are considered the multidimensional relative treatment effect, which are synthesized (and indirectly compared) across studies | Weibull, Gompertz, log-normal, log-logistic | One-step (trial level–specific relative treatment effects and pooled effects are estimated simultaneously) | Bayesian | Approximation with piecewise constant hazards (discrete hazards) according to a binomial likelihood | Multivariate relative treatment effect parameters regarding scale and shape related factors of the survival distribution/function. These relative treatment effect parameters are used to describe time-varying HRs (or odds ratios in case of log-logistic models) | Yes | Yes | No |
Jansen 7 | First (Weibull, Gompertz) and second-order fractional polynomials describing the ln-hazards over time | No | |||||||
Jansen and Cope 12 | Yes | ||||||||
Vickers et al 13 | Yes - extended to exchangeable treatment-by-covariate-interaction structures † “Allow relative treatment effects to vary by covariates independently of the other treatments in the network of evidence. The relative treatment effect remains constant for any treatment not specified within a hierarchical exchangeable structure… In addition, where possible, different doses also were included as a hierarchical structure with an overall treatment class effect. Constraints were imposed to ensure that the efficacy increased with dose intensity.” Adapted from Owen et al.17 | ||||||||
Cope et al 8 | For each arm of every RCT in the network, (recreated), IPD are used to estimate alternative survival distributions. Next, for each distribution, its scale and shape parameters are included in a multivariate NMA to obtain time-varying estimates of relative treatment effects between competing interventions | Weibull, Gompertz, log-normal, log-logistic describing the ln-hazards over time | Two-step (arm-specific survival function parameters are estimated first. Subsequently, these are incorporated in the multivariate NMA) | Step 1 – Frequentist; Step 2 – Bayesian | Exact likelihood corresponding to survival distribution selected | No | |||
Freeman and Carpenter 9 | An IPD Royston-Parmar Bayesian NMA model, which provides flexible alternative modeling approach that can accommodate time-dependent effects. The baseline log-cumulative hazards are modeled with restricted cubic splines. HRs are either fixed over time or can be modeled as a function of ln(time) | Restricted cubic splines describing the baseline log-cumulative hazard of each trial | Two-step (described as 1-step but requires orthogonalized basis function of study-specific splines as input for NMA) | Bayesian framework for NMA but first step in frequentist framework | General likelihood using “zeros trick” using probability density function of Poisson distribution ‡ If we wish to implement a likelihood representing the flexible fractional polynomials or cubic splines in WinBUGS, we can use the “zeros” trick17 where a data set comprising entirely of zeros is given a Poisson distribution with its parameter defined equal to the negative log-likelihood (plus a sufficiently large constant); the log-likelihood function corresponding to the fractional polynomial or spline is then written algebraically in the WinBUGS code. | Constant HRs represented with a single basic parameter by treatment. As an extension, HR can vary over time by adding extra parameters for the interaction between treatment and ln(time) | Yes | Yes | No |
Petit et al 14 | A 2-step analysis to estimate RMST based on (reconstructed) IPD from KMs; then evaluated mean difference in RMST in NMA model | RMS estimated based on (1) trial-specific KM method; (2) AUC of KM + exponential tail | Two-step | Frequentist | Normal likelihood for NMA model and exact likelihood corresponding to parametric distribution selected (if extrapolation involved) | Difference in RMS (AUC up to specific time point) between treatments | Yes | Yes | No |
Connock et al 15
Comparative survival benefit of currently licensed second or third line treatments for epidermal growth factor receptor (EGFR) and anaplastic lymphoma kinase (ALK) negative advanced or metastatic non-small cell lung cancer: a systematic review and secondary analysis of trials. BMC Cancer. 2019; 19: 392 | RMS estimated based on (1) AUC Weibull/gen gamma per arm; (2) mean survival using Weibull; (3) AUC of KM + exponential tail | Bayesian | No | No | No | ||||
Niglio et al 16 | RMS estimated based on (1) pseudo-values based on KM; (2) Poisson-gamma frailty model | Frequentist | No | No | No |
One-Step Multivariate NMA Model
Two-Step Multivariate NMA Model
NMA Model With Cubic Splines for Baseline Hazard
Restricted Mean Survival NMA
- Connock M.
- Armoiry X.
- Tsertsvadze A.
- et al.
Illustrative Example
Methods
Evidence base

Analysis
NMA based on constant HRs
One-step multivariate NMA
Two-step multivariate NMA
NMAs with RCS for baseline hazard
Restricted mean survival time NMA
Model parameter estimation
Model selection by NMA method
- Connock M.
- Armoiry X.
- Tsertsvadze A.
- et al.
- Latimer N.
Model selection process | One-step multivariate NMA 6 ,7 | Two-step multivariate NMA methods 8 ,20 | NMA cubic splines for baseline hazard 9 | RMST NMA 14 , 15 ,
Comparative survival benefit of currently licensed second or third line treatments for epidermal growth factor receptor (EGFR) and anaplastic lymphoma kinase (ALK) negative advanced or metastatic non-small cell lung cancer: a systematic review and secondary analysis of trials. BMC Cancer. 2019; 19: 392 16 | ||
---|---|---|---|---|---|---|
Competing models considered | Traditional parametric models: Log-logistic, Log-normal, Weibull, and Gompertz Treatment effect:
| FP models: P1 = 0 or 1; P2 = −1, −0.5, 0, 0.5, or 1 Treatment effect:
| Traditional parametric models: Log-logistic, Log-normal, Weibull, Gompertz Treatment effect:
| FP models: P1 = 0 or 1; P2 = −1, −0.5, 0, 0.5, or 1 Treatment effect:
| Royston-Parmar RCS models: 1, 2, or 3 internal knots All studies with 1 knot Study-specific number of knots Treatment effect:
| (A) RMST up to 2 years based on KM curves (B) RMST up to 5 years using the best-fitting model per arm (RCS with 1, 2, or 3 internal knots) |
Trial-level diagnostics to assess the development of the hazard over time | ![]() | ![]() | ![]() | ![]() | ![]() | ![]() |
(1) PH evaluations and diagnostics (ie, time-dependent covariate test using Cox model, plot scaled Schoenfeld residuals over time 48 ,49 and Grambsch and Therneau test50 ); (2) Quantile-quantile plot of times of survival percentiles; (3) Log-cumulative hazard plots vs log(time); and (4) Plot of smoothed empirical hazard vs time | ||||||
Select arm or trial-level model | ![]() Not applicable since study-level estimates and indirect comparisons performed simultaneously | ![]() Not applicable since study-level estimates and indirect comparisons performed simultaneously | ![]() Arm-level model: AIC per arm and across full network and visual inspection per arm | ![]() Arm-level model: AIC per arm and across full network and visual inspection per arm | ![]() Trial-level models: AIC per trial and visual inspection per trial | ![]() Arm-level models when extrapolation is required for (B): AIC per arm and visual inspection per arm |
NMA models – goodness of fit per method | ![]() DICs can be compared across parametric models since uses same data and likelihood to inform model † Given the star-shaped network with only 1 trial per contrast, we were able to use the relative treatment effects as estimated from the NMA model to predict trial-specific hazards and survival (by applying them to trial-specific baseline), which were visually inspected and compared with the observed hazards and survival. Nevertheless, this step is not proposed by authors in the original papers and would not be feasible if there were > 1 study informing each comparison or if there was indirect evidence. | ![]() DICs across FP models can be compared since uses same data and same likelihood to inform model and parameterization of relative treatment effects (PH vs non-PH) † Given the star-shaped network with only 1 trial per contrast, we were able to use the relative treatment effects as estimated from the NMA model to predict trial-specific hazards and survival (by applying them to trial-specific baseline), which were visually inspected and compared with the observed hazards and survival. Nevertheless, this step is not proposed by authors in the original papers and would not be feasible if there were > 1 study informing each comparison or if there was indirect evidence. | ![]() DICs across parametric models cannot be compared since data (ie, parameters of different distributions) is not the same across models (likelihood is same) | ![]() DICs across FP models cannot be compared since data (ie, parameters of different distributions) is not the same across models (likelihood is same) | ![]() DICs across spline models can be compared since the model uses same data and likelihood to inform model and parameterization of relative treatment effects (ie, PH vs non-PH) | ![]() Only 1 RMST model evaluated for extrapolated survival (B); If alternative extrapolations were used to inform RMST, DICs across RMST models could not be compared (data would differ) |
Comparison of NMA methods regarding treatment effects
Results
Model selection
Comparison of NMA methods regarding relative treatment effects


Discussion
Findings From the Illustrative Example
Limitations of the Illustrative Example
Motzer RJ, Tannir NM, McDermott DF, et al. Conditional survival and 5-year follow-up in CheckMate 214: First-line nivolumab plus ipilimumab (N+I) versus sunitinib (S) in advanced renal cell carcinoma (aRCC). 661P. Presented at: European Society for Medical Oncology, September 16-21, 2021; Virtual Congress.
Comparison With the Literature
- Freeman S.C.
- Cooper N.J.
- Sutton A.J.
- Crowther M.J.
- Carpenter J.R.
- Hawkins N.
- Freeman S.C.
- Cooper N.J.
- Sutton A.J.
- Crowther M.J.
- Carpenter J.R.
- Hawkins N.
Proposed Model Selection Process for Non-PH NMA Methods

- Latimer N.
Conclusion
Article and Author Information
Acknowledgment
Supplemental Material
References
- CONSORT 2010 explanation and elaboration: updated guidelines for reporting parallel group randomised trials.J Clin Epidemiol. 2010; 63: e1-e37
- Review of the reporting of survival analyses within randomised controlled trials and the implications for meta-analysis.PLoS One. 2016; 11e0154870
- Restricted mean survival time: an alternative to the hazard ratio for the design and analysis of randomized trials with a time-to-event outcome.BMC Med Res Methodol. 2013; 13: 152
- Pooled analysis of long-term survival data from phase II and phase III trials of ipilimumab in unresectable or metastatic melanoma.J Clin Oncol. 2015; 33: 1889-1894
- Statistical inference methods for two crossing survival curves: a comparison of methods.PLoS One. 2015; 10e0116774
- Network meta-analysis of parametric survival curves.Res Synth Methods. 2010; 1: 258-271
- Network meta-analysis of survival data with fractional polynomials.BMC Med Res Methodol. 2011; 11: 61
- Multivariate network meta-analysis of survival function parameters.Res Synth Methods. 2020; 11: 443-456
- Bayesian one-step IPD network meta-analysis of time-to-event data using Royston-Parmar models.Res Synth Methods. 2017; 8: 451-464
- Uptake of methodological advances for synthesis of continuous and time-to-event outcomes would maximize use of the evidence base.J Clin Epidemiol. 2020; 124: 94-105
- PNS92 a targeted review of indirect treatment comparison synthesis methods that do not rely on the proportional hazard assumption.Value Health. 2021; 24S189
- Meta-regression models to address heterogeneity and inconsistency in network meta-analysis of survival outcomes.BMC Med Res Methodol. 2012; 12: 152
- Relative efficacy of interventions in the treatment of second-line non-small cell lung cancer: A systematic review and network meta-analysis.BMC Cancer. 2019; 19: 353
- Individual patient data network meta-analysis using either restricted mean survival time difference or hazard ratios: is there a difference? A case study on locoregionally advanced nasopharyngeal carcinomas.Syst Rev. 2019; 8: 96
- Comparative survival benefit of currently licensed second or third line treatments for epidermal growth factor receptor (EGFR) and anaplastic lymphoma kinase (ALK) negative advanced or metastatic non-small cell lung cancer: a systematic review and secondary analysis of trials.BMC Cancer. 2019; 19: 392
- Programmed death-1 or programmed death ligand-1 blockade in patients with platinum-resistant metastatic urothelial cancer: a systematic review and meta-analysis.Eur Urol. 2019; 76: 782-789
- The BUGS Book.Boca Raton, FL: CRC Press, 2013
- Network meta-analysis: development of a three-level hierarchical modeling approach incorporating dose-related constraints.Value Health. 2015; 18: 116-126
- Enhanced secondary analysis of survival data: reconstructing the data from published Kaplan-Meier survival curves.BMC Med Res Methodol. 2012; 12: 9
- PCN283 extending multivariate network meta-analysis of survival function parameters to fractional polynomials.Value Health. 2020; 23: S472
- Flexible parameteric proportional-hazards and proportional-odds models for censored survival data, with application to prognostic modelling and estimation of treatment effects.Stat Med. 2002; 21: 2175-2197
- Meta-analysis of time-to-event outcomes from randomized trials using restricted mean survival time: application to individual participant data.Stat Med. 2015; 34: 2881-2898
- Bias and precision of measures of survival gain from right-censored data.Pharm Stat. 2015; 14: 409-417
- Bias and precision of methods for estimating the difference in restricted mean survival time from an individual patient data meta-analysis.BMC Med Res Methodol. 2016; 16: 37
- Cabozantinib versus everolimus, nivolumab, axitinib, sorafenib and best supportive care: a network meta-analysis of progression-free survival and overall survival in second line treatment of advanced renal cell carcinoma.PLoS One. 2017; 12e0184423
- Axitinib, cabozantinib, everolimus, nivolumab, sunitinib and best supportive care in previously treated renal cell carcinoma: a systematic review and economic evaluation.Health Technol Assess. 2018; 22: 1
- Network meta-analysis of second-line treatment in metastatic renal cell carcinoma: efficacy and safety.Future Oncol. 2017; 13: 2709-2717
- Comparative efficacy of first-line immune-based combination therapies in metastatic renal cell carcinoma: a systematic review and network meta-analysis.Cancers. 2020; 12: 1673
- Treatment of metastatic renal cell carcinoma in older patients: a network meta-analysis.J Geriatr Oncol. 2019; 10: 149-154
- Targeted therapies for previously treated advanced or metastatic renal cell carcinoma: systematic review and network meta-analysis.BMJ Open. 2019; 9e024691
- Nivolumab previously treated advanced renal cell carcinoma. National Institute for Health and Care Excellence (NICE).https://www.nice.org.uk/guidance/ta417Date accessed: February 4, 2021
- Lenvatinib everolimus previously treated advanced renal cell carcinoma. National Institute for Health and Care Excellence (NICE).https://www.nice.org.uk/guidance/ta498Date accessed: February 4, 2021
- Nivolumab with ipilimumab untreated advanced renal cell carcinoma. National Institute for Health and Care Excellence (NICE).https://www.nice.org.uk/guidance/ta581Date accessed: February 4, 2021
- Second-line cabozantinib versus nivolumab in advanced renal cell carcinoma: systematic review and indirect treatment comparison.Crit Rev Oncol Hematol. 2019; 139: 143-148
- Indirect treatment comparisons including network meta-analysis: lenvatinib plus everolimus for the second-line treatment of advanced/metastatic renal cell carcinoma.PLoS One. 2019; 14e0212899
- Nivolumab versus cabozantinib: comparing overall survival in metastatic renal cell carcinoma.PLoS One. 2016; 11e0155389
- Survival outcomes and independent response assessment with nivolumab plus ipilimumab versus sunitinib in patients with advanced renal cell carcinoma: 42-month follow-up of a randomized phase 3 clinical trial.J Immunother Cancer. 2020; 8e000891
- Nivolumab plus ipilimumab versus sunitinib in advanced renal-cell carcinoma.N Engl J Med. 2018; 378: 1277-1290
- Pazopanib versus sunitinib in metastatic renal-cell carcinoma.N Engl J Med. 2013; 369: 722-731
- Overall survival in renal-cell carcinoma with pazopanib versus sunitinib.N Engl J Med. 2014; 370: 1769-1770
- Updated efficacy results from the JAVELIN Renal 101 trial: first-line avelumab plus axitinib versus sunitinib in patients with advanced renal cell carcinoma.Ann Oncol. 2020; 31: 1030-1039
- Avelumab plus axitinib versus sunitinib for advanced renal-cell carcinoma.N Engl J Med. 2019; 380: 1103-1115
- Pembrolizumab plus axitinib versus sunitinib as first-line therapy for advanced renal cell carcinoma (RCC): updated analysis of KEYNOTE-426.J Clin Oncol. 2020; 38 (5001-5001)
- Pembrolizumab plus axitinib versus sunitinib for advanced renal-cell carcinoma.N Engl J Med. 2019; 380: 1116-1127
- Pembrolizumab plus axitinib versus sunitinib monotherapy as first-line treatment of advanced renal cell carcinoma (KEYNOTE-426): extended follow-up from a randomised, open-label, phase 3 trial.Lancet Oncol. 2020; 21: 1563-1573
- Evidence synthesis for decision making 4: inconsistency in networks of evidence based on randomized controlled trials.Med Decis Making. 2013; 33: 641-656
- Regression models and life-tables.J R Stat Soc B. 1972; 34: 187-202
- Applied Survival Analysis.2nd ed. John Wiley & Sons, Inc, Hoboken, NJ2008
- Partial residuals for the proportional hazards regression model.Biometrika. 1982; 69: 239-241
- Proportional hazards tests and diagnostics based on weighted residuals.Biometrika. 1994; 81: 515-526
- NICE DSU technical support document 14: undertaking survival analysis for economic evaluations alongside clinical trials - extrapolation with patient-level data. National Institute for Health and Care Excellence (NICE).http://www.nicedsu.org.uk/NICE%20DSU%20TSD%20Survival%20analysis.updated%20March%202013.v2.pdfDate accessed: October 1, 2016
- Bayesian measures of model complexity and fit.J R Stat Soc B. 2002; 64: 583-639
- National life tables: UK. Office for National Statistics.
- Pembrolizumab with axitinib for untreated advanced renal cell carcinoma. National Institute for Health and Care Excellence (NICE).https://www.nice.org.uk/guidance/ta650Date accessed: February 17, 2022
- Avelumab with axitinib for untreated advanced renal cell carcinoma. National Institute for Health and Care Excellence (NICE).https://www.nice.org.uk/guidance/ta645Date accessed: February 17, 2022
- Quantitative summaries of treatment effect estimates obtained with network meta-analysis of survival curves to inform decision-making.BMC Med Res Methodol. 2013; 13: 147
- Graphical methods and numerical summaries for presenting results from multiple-treatment meta-analysis: an overview and tutorial.J Clin Epidemiol. 2011; 64: 163-171
- Nonlocal behavior in polynomial regressions.Am Stat. 1998; 52: 20-22
- Generalized linear models for flexible parametric modeling of the hazard function.Med Decis Making. 2019; 39: 867-878
- NICE DSU technical support document 21: flexible methods for survival analysis. National Institute for Health and Care Excellence (NICE).(Accessed February 17, 2022)
Motzer RJ, Tannir NM, McDermott DF, et al. Conditional survival and 5-year follow-up in CheckMate 214: First-line nivolumab plus ipilimumab (N+I) versus sunitinib (S) in advanced renal cell carcinoma (aRCC). 661P. Presented at: European Society for Medical Oncology, September 16-21, 2021; Virtual Congress.
- What did time tell us? A comparison and retrospective validation of different survival extrapolation methods for immuno-oncologic therapy in advanced or metastatic renal cell carcinoma.Pharmacoeconomics. 2021; 39: 345-356
- Challenges of modelling approaches for network meta-analysis of time-to-event outcomes in the presence of non-proportional hazards to aid decision making: application to a melanoma network.Stat Methods Med Res. 2022; 31: 839-861
- NICE DSU technical support document 6: embedding evidence synthesis in probabilistic cost-effectiveness analysis: software choices. National Institute for Clinical Excellence (NICE).(Accessed February 17, 2022)
- Survival analysis for economic evaluations alongside clinical trials--extrapolation with patient-level data: inconsistencies, limitations, and a practical guide.Med Decis Making. 2013; 33: 743-754
- Guide to the methods of technology appraisal. National Institute for Clinical Excellence (NICE).(Accessed February 17, 2022)
- NICE DSU technical support document 5: evidence synthesis in the baseline natural history model. National Institute for Clinical Excellence (NICE).(Accessed February 17, 2022)
- NICE DSU technical support document 19: partitioned survival analysis for decision modelling in health care: a critical review. National Institute for Clinical Excellence (NICE).(Accessed February 17, 2022)
- Nonproportional hazards in network meta-analysis: efficient strategies for model building and analysis.Value Health. 2020; 23: 918-927
- Regression Modeling Strategies: With Applications to Linear Models, Logistic and Ordinal Regression, and Survival Analysis.Springer, Berlin, Germany2015
- First-line sunitinib versus pazopanib in metastatic renal cell carcinoma: results from the International Metastatic Renal Cell Carcinoma Database Consortium.Eur J Cancer. 2016; 65: 102-108
- How uncertain is the survival extrapolation? A study of the impact of different parametric survival models on extrapolated uncertainty about hazard functions, lifetime mean survival and cost effectiveness.Pharmacoeconomics. 2020; 38: 193-204
Article info
Publication history
Publication stage
In Press Corrected ProofIdentification
Copyright
User license
Creative Commons Attribution – NonCommercial – NoDerivs (CC BY-NC-ND 4.0) |
Permitted
For non-commercial purposes:
- Read, print & download
- Redistribute or republish the final article
- Text & data mine
- Translate the article (private use only, not for distribution)
- Reuse portions or extracts from the article in other works
Not Permitted
- Sell or re-use for commercial purposes
- Distribute translations or adaptations of the article
Elsevier's open access license policy