Highlights
- •Despite the vast amount of real-life data accumulated in healthcare, EQ-5D index scores are frequently lacking for health economic analyses and have to be estimated.
- •We predicted 3 level version of EQ-5D (EQ-5D-3L) index scores in a large heterogenous data set of population surveys and clinical studies using eXtreme Gradient Boosting classification, eXtreme Gradient Boosting regression, and ordinary least squares regression. Regression methods outperformed classification in terms of prediction accuracy and bias. The performance of the 3 methods depended on the applied evaluation criteria, the target population, the included predictors, and the EQ-5D-3L index score range.
- •The prediction accuracy of individual EQ-5D-3L index scores was inadequate for the majority of respondents. For the evaluation of personalized health interventions, we encourage the systematic collection of patient-reported outcomes such as EQ-5D with the involvement of artificial intelligence experts and outcomes researchers to enhance the value of accumulating data in health systems.
Abstract
Objectives
Methods
Results
Conclusions
Keywords
Introduction
National Centre for Healthcare Services.
Methods
EQ-5D-3L
Study Population
Database and Variables
Demographic and disease-related variables
PRO
Data Analysis
Missing data
Prediction models
Evaluation of Model Performance
where is the mean and SD is the SD of the evaluation metric (eg, MAE) of the k cross-validation sets.
Results
Sample Characteristics
Predictor group | Variable | Category | Sample | ||
---|---|---|---|---|---|
General population (“Pop”) | Patients (“Pts”) | Entire sample (“Total”) | |||
“Base” | Age | Mean | 41.1 | 55.6 | 45.4 |
SD | 15.5 | 16.6 | 17.2 | ||
Missing (%) | 0.0 | 4.2 | 1.3 | ||
Gender | Male (%) | 54.6 | 50.4 | 53.3 | |
Female (%) | 45.4 | 49.6 | 46.7 | ||
Missing (%) | 0.0 | 2.8 | 0.9 | ||
Education | Primary (%) | 10.8 | 23.3 | 14.2 | |
Secondary (%) | 39.7 | 57.6 | 44.6 | ||
Tertiary (%) | 49.5 | 19.1 | 41.2 | ||
Missing (%) | 0.0 | 12.9 | 3.9 | ||
Place of residence | Capital (%) | 18.1 | 6.2 | 13.5 | |
City (%) | 51.5 | 45.2 | 49.0 | ||
Village (%) | 30.4 | 48.6 | 37.5 | ||
Missing (%) | 56.8 | 36.6 | 50.7 | ||
Family status | Single (%) | 36.2 | 35.0 | 36.1 | |
Married (%) | 63.8 | 65.0 | 63.9 | ||
Missing (%) | 26.6 | 77.0 | 41.9 | ||
Employment | Paid employment (%) | 68.6 | 42.9 | 61.6 | |
Student (%) | 9.2 | 12.8 | 10.2 | ||
Pensioner (%) | 15.0 | 33.7 | 20.1 | ||
Not working (%) | 5.0 | 6.7 | 5.5 | ||
Other employment (%) | 2.2 | 3.9 | 2.6 | ||
Missing (%) | 0.0 | 13.4 | 4.1 | ||
Relative income (0-11.0) | Mean | 1.6 | 0.5 | 1.6 | |
SD | 1.5 | 0.3 | 1.5 | ||
Missing (%) | 32.0 | 97.2 | 51.8 | ||
Setting | General population (%) | 100.0 | - | 69.7 | |
Outpatient (%) | - | 30.1 | 9.1 | ||
Hospitalized (%) | - | 39.2 | 11.9 | ||
Postoperative (%) | - | 30.7 | 9.3 | ||
Number of GP visits at 12 months | Mean | - | 4.0 | 4.0 | |
SD | - | 6.1 | 6.1 | ||
Missing (%) | 100.0 | 78.7 | 93.5 | ||
Any GP visit past year | No (%) | - | 49.1 | 49.1 | |
Yes (%) | - | 50.9 | 50.9 | ||
Missing (%) | 100.0 | 78.7 | 93.5 | ||
Specialist visits past year | Mean | - | 5.8 | 5.8 | |
SD | - | 7.5 | 7.5 | ||
Missing (%) | 100.0 | 80.9 | 94.2 | ||
Any specialist visits past year | No (%) | - | 17.1 | 17.1 | |
Yes (%) | - | 82.9 | 82.9 | ||
Missing (%) | 100.0 | 80.9 | 94.2 | ||
Hospitalizations past year | Mean | 0.2 | 1.7 | 0.7 | |
SD | 0.6 | 3.8 | 2.2 | ||
Missing (%) | 90.0 | 86.5 | 89.0 | ||
Any hospitalization at 12 months | No (%) | 90.4 | 41.9 | 72.5 | |
Yes (%) | 9.6 | 58.1 | 27.5 | ||
Missing (%) | 90.0 | 86.5 | 89.0 | ||
Informal care recipient | No (%) | 92.2 | 70.9 | 82.4 | |
Yes (%) | 7.8 | 29.1 | 17.6 | ||
Missing (%) | 90.0 | 80.3 | 87.1 | ||
Weight, kg | Mean | 76.1 | 75.2 | 75.6 | |
SD | 16.1 | 16.8 | 16.4 | ||
Missing (%) | 88.9 | 72.8 | 84.0 | ||
Height, cm | Mean | 171.5 | 167.6 | 169.7 | |
SD | 9.4 | 9.7 | 9.8 | ||
Missing (%) | 88.8 | 77.7 | 85.5 | ||
BMI | Mean | 25.8 | 26.8 | 26.3 | |
SD | 4.8 | 5.2 | 5.0 | ||
Missing (%) | 88.9 | 77.8 | 85.5 | ||
DRO score | Mean | - | 0.7 | 0.7 | |
SD | - | 0.2 | 0.2 | ||
Missing (%) | 100.0 | 91.1 | 97.3 | ||
Chronic morbidity | No (%) | 68.5 | 0.0 | 68.5 | |
Yes (%) | 31.5 | 0.0 | 31.5 | ||
Missing (%) | 90.0 | 100.0 | 93.0 | ||
Any disease | No (%) | 70.5 | 0.0 | 29.5 | |
Yes (%) | 29.5 | 100.0 | 70.5 | ||
Missing (%) | 68.8 | 0.0 | 47.9 | ||
Specific diagnoses | Not included in the table | ||||
“PRO” | Happiness | Mean | 7.6 | - | 7.6 |
SD | 2.0 | - | 2.0 | ||
Missing (%) | 90.0 | 100.0 | 93.0 | ||
Self-rated health | Very good (%) | 20.7 | 0.0 | 20.7 | |
Good (%) | 45.3 | 0.0 | 45.3 | ||
Fair (%) | 26.9 | 0.0 | 26.9 | ||
Bad (%) | 6.2 | 0.0 | 6.2 | ||
Very Bad (%) | 0.9 | 0.0 | 0.9 | ||
Missing (%) | 90.0 | 100.0 | 93.0 | ||
GALI | Severely limited (%) | 3.3 | 0.0 | 3.3 | |
Limited, but not severely (%) | 16.8 | 0.0 | 16.8 | ||
Not limited (%) | 79.9 | 0.0 | 79.9 | ||
Missing (%) | 90.0 | 100.0 | 93.0 | ||
PRO score | Mean | - | 0.7 | 0.7 | |
SD | - | 0.3 | 0.3 | ||
Missing (%) | 100.0 | 80.7 | 94.1 | ||
EQ-VAS (0-100) | Mean | 77.0 | 65.4 | 73.3 | |
SD | 18.9 | 22.3 | 20.7 | ||
Missing (%) | 11.5 | 6.3 | 9.9 |
The Distribution of True and Predicted EQ-5D-3L Index Scores
Accuracy of Predictions



Patterns of Prediction Error
Accuracy of Predictions by True EQ-5D-3L Index Scores

Accuracy of Predictions by Predicted EQ-5D-3L Index Scores

Discussion
- Borchani H.
- Bielza C.
- Marti Nez-Marti N.P.
- Larranaga P.
- Borchani H.
- Bielza C.
- Marti Nez-Marti N.P.
- Larranaga P.
Conclusions
Article and Author Information
Supplemental Materials
- Supplementary Material 1
References
- Recommendations to OECD Ministers of Health from the high level reflection group on the future of health statistics: strengthening the international comparison of health system performance through patient-reported indicators. OECD.
- Endpoints used for relative effectiveness assessment of pharmaceuticals. Health-related quality of life and utility measures. EUnetHTA.www.eunethta.euDate accessed: June 30, 2021
- Which multi-attribute utility instruments are recommended for use in cost-utility analysis? A review of national health technology assessment (HTA) guidelines.Eur J Health Econ. 2020; 21: 1245-1257
- EQ-5D is a recommended tool for use in cost-utility analyses around the globe. EuroQoL Group.https://euroqol.org/eq-5d-is-a-recommended-tool-for-use-in-cost-utility-analyses-around-the-globe/Date accessed: January 29, 2020
- EuroQol — a new facility for the measurement of health-related quality of life.Health Policy. 1990; 16: 199-208
- National Centre for Healthcare Services.https://www.eeszt.gov.hu/hu/nyito-oldalDate accessed: June 30, 2021
- Slovenia moving HIS applications to central eHealth platform (eZdravje). European Commission. Joinup Web Site.
- Implementation of electronic health records in Polish outpatient health care clinics — starting point, progress, problems, and forecasts.Ann Agric Environ Med. 2016; 23: 329-334
- Impact of big data analytics on people’s health: overview of systematic reviews and recommendations for future studies.J Med Internet Res. 2021; 23e27275
- Digitalisation and Big Data: Implications for the Health Sector.European Parliament, Brussels, Belgium2018
- The potential for artificial intelligence in healthcare.Future Healthc J. 2019; 6: 94-98
- Revolutionizing healthcare with IoT and cognitive, cloud-based telemedicine.Acta Polytech Hung. 2019; 16: 163-181
- Toward precision health: applying artificial intelligence analytics to digital health biometric datasets.Per Med. 2020; 17: 307-316
- Towards a European health data ecosystem.Eur J Risk Regul. 2020; 11: 884-893
- EQ-5D and the EuroQol Group: past, present and future.Appl Health Econ Health Policy. 2017; 15: 127-137
- Collection and use of EQ-5D for follow-up, decision-making, and quality improvement in health care — the case of the Swedish National Quality Registries.J Patient Rep Outcomes. 2020; 4: 78
- EQ-5D in Central and Eastern Europe: 2000-2015.Qual Life Res. 2016; 25: 2693-2710
- Results of the 2014 Health Interview Survey - summary data.KSH, Budapest, Hungary2018
- Mapping to obtain EQ-5D utility values for use in NICE health technology assessments.Value Health. 2013; 16: 202-210
- Estimating EuroQol EQ-5D scores from Population Healthy Days data.Med Decis Mak. 2008; 28: 491-499
- Estimating EQ-5D utilities based on the Short-Form Long Term Conditions Questionnaire (LTCQ-8).Health Qual Life Outcomes. 2020; 18: 279
- Disease burden of psoriatic arthritis compared to rheumatoid arthritis, Hungarian experiment.Rheumatol Int. 2009; 30: 199-205
- Assessment of health-related quality of life in psoriasis patients in Hungary.Orv Hetil. 2018; 159: 837-846
- [Quality of life and burden of disease in peripheral arterial disease: a study among Hungarian patients].Orv Hetil. 2013; 154: 464-470
- Costs of rheumatoid arthritis in Hungary.J Rheumatol. 2007; 34: 1437
- Cost-of-illness of patients with systemic sclerosis in a tertiary care centre.Rheumatol (Oxf Engl). 2010; 49: 1920-1928
- Costs of dementia in Hungary.J Nutr Health Aging. 2010; 14: 633-639
- Health-related costs of diabetes mellitus in adults treated with insulin. Cross-sectional survey of 480 patients in general practice and outpatient settings.Diabetologia Hungarica. 2010; 19: 37-44
- The burden of endometriosis: costs and quality of life of women with endometriosis and treated in referral centres.Hum Reprod. 2012; 27: 1292-1299
- Ten-year fracture risk by FRAX((R)) of women with osteoporosis attending osteoporosis care in Hungary.Orv Hetil. 2016; 157: 146-153
- Exploring the relationship between quality of life (EQ-5D) and clinical measures in adult attention deficit hyperactivity disorder (ADHD).Appl Res Qual Life. 2016; 12: 409-424
- Health related quality of life in patients with bladder cancer: a cross-sectional survey and validation study of the Hungarian version of the Bladder Cancer Index.Pathol Oncol Res. 2015; 21: 619-627
- Cost of illness of medically treated benign prostatic hyperplasia in Hungary.Int Urol Nephrol. 2015; 47: 1241-1249
- Survey of epilepsy in adults in Hungary: quality of life and costs.Ideggyógyászati Sz. 2013; 66: 262
- Health related quality of life and productivity of women with overactive bladder.Orv Hetil. 2012; 153: 1068-1076
- Quality of life and costs in Parkinson’s disease: a cross sectional study in Hungary.PLoS One. 2014; 9e107704
- Quality of life and disease burden of patients with schizophrenia in Hungary.Psychiatr Hung. 2012; 27: 4-17
- Health status and costs of ambulatory patients with multiple sclerosis in Hungary.Ideggyogy Sz. 2012; 65: 316-324
- Pns401 the first parallel Eq-5d-3l and Eq-5d-5l composite time trade-off valuation study in Europe.Value Health. 2019; 22
- Health status of the Hungarian population between 2000-2010.Orv Hetil. 2015; 156: 2035-2044
- Subjective health expectations of patients with age-related macular degeneration treated with antiVEGF drugs.BMC Geriatr. 2017; 17: 233
- Subjective expectations regarding length and health-related quality of life in Hungary: results from an empirical investigation.Health Expect. 2014; 17: 696-709
- European value of a quality adjusted life year — final publishable report.(Newcastle)2010
- Valuation of EQ-5D health states in Poland: first TTO-based social value set in Central and Eastern Europe.Value Health. 2010; 13: 289-297
- General population reference values for 3-level EQ-5D (EQ-5D-3L) questionnaire in Poland.Pol Arch Med Wewn. 2015; 125: 18-26
- Validity of EQ-5D-5L in stroke.Qual Life Res. 2015; 24: 845-850
- Pos14 quality of life according to EQ-5D after osteoporotic hip fracture in Poland.Value Health. 2006; 9: A382-A383
- Eq1 psychometric comparison of Eq-5d and Eq-5d-5l in student population.Value Health. 2010; 13: A240
- Valuation of EQ-5D-3l Health States in Slovenia: VAS based and TTO based value sets.Slovenian Journal of Public Health. 2019; 59: 8-17
- The Slovenian VAS tariff based on valuations of EQ-5D health states from the general population. EuroQol 17th Plenary Meeting.(Pamplona, Spain)2000
- Use of the EQ-5D instrument and value scale in comparing health states of patients in four health care programs among health care providers.Value Health Reg Issues. 2014; 4: 95-99
- Health-related productivity of the Hungarian population. A cross-sectional survey.Orv Hetil. 2020; 161: 1522-1533
- Self-Reported Population Health: An International Perspective Based on EQ-5D. Springer, Dordrecht, The Netherlands2014
- Modeling valuations for EuroQol health states.Med Care. 1997; 35: 1095-1108
- Clinical applications of visual analogue scales: a critical review.Psychol Med. 1988; 18: 1007-1019
- Mini-mental state.J Psychiatr Res. 1975; 12: 189-198
- Rating neurologic impairment in multiple sclerosis: an expanded disability status scale (EDSS).Neurology. 1983; 33: 1444-1452
- Severe psoriasis—oral therapy with a new retinoid.Dermatologica. 1978; 157: 238-244
- Parallel valuation of the EQ-5D-3L and EQ-5D-5L by time trade-off in Hungary.Value Health. 2020; 23: 1235-1245
- International Statistical Classification of Diseases and Related Health Problems 10th Revision.https://icd.who.int/browse10/2016/ensDate accessed: January 5, 2019
- Assessing the performance of the EQ-VAS in the NHS Proms programme.Qual Life Res. 2014; 23: 977-989
- Happiness in Nations: Overview of Happiness Surveys Using Measure Type. Erasmus University Rotterdam, Happiness Economics Research Organisation.http://worlddatabaseofhappiness.eur.nlDate accessed: March 20, 2019
- Glossary: Minimum European Health Module (MEHM). Eurostat.https://ec.europa.eu/eurostat/statistics-explained/index.php/Glossary:Minimum_European_Health_Module_(MEHM)Date accessed: February 24, 2019
- The reliability of the Minimum European Health Module.Int J Public Health. 2009; 54: 55-60
- The dimensions of health outcomes: the health assessment questionnaire, disability and pain scales.J Rheumatol. 1982; 9: 789-793
- Dermatology Life Quality Index (DLQI)—a simple practical measure for routine clinical use.Clin Exp Dermatol. 1994; 19: 210-216
- Functional evaluation: the Barthel index.Md State Med J. 1965; 14: 61-65
- A functional recovery score for elderly hip fracture patients: I. Development.J Orthop Trauma. 2000; 14: 20-25
- Development and validation of the Bladder Cancer Index: a comprehensive, disease specific measure of health related quality of life in patients with localized bladder cancer.J Urol. 2010; 183: 1764-1769
- The American Urological Association symptom index for benign prostatic hyperplasia.J Urol. 1992; 148: 1549-1557
Chen T, Guestrin C. XGBoost: a scalable tree boosting system. Paper presented at: KDD’16: 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining; New York; 2016.
- A comparative analysis of gradient boosting algorithms.Artif Intell Rev. 2020; 54: 1937-1967
- Advantages of the mean absolute error (MAE) over the root mean square error (RMSE) in assessing average model performance.Clim Res. 2005; 30: 79-82
- Comparison of the minimally important difference for two health state utility measures: EQ-5D and SF-6D.Qual Life Res. 2005; 14: 1523-1532
- What determines the shape of an EQ-5D index distribution?.Med Decis Mak. 2016; 36: 941-951
- A comparison of the EQ-5D and SF-6D across seven patient groups.Health Econ. 2004; 13: 873-884
- Principles of Data Mining.Springer, London, United Kingdom2007
- Artificial intelligence and algorithmic bias: implications for health systems.J Glob Health. 2019; 9010318
- Evaluation of machine learning algorithms for health and wellness applications: a tutorial.Comput Biol Med. 2021; 132104324
- Predicting patient-reported outcomes following hip and knee replacement surgery using supervised machine learning.BMC Med Inform Decis Mak. 2019; 19: 3
- Prediction model for health-related quality of life of elderly with chronic diseases using machine learning techniques.Healthc Inform Res. 2014; 20: 125-134
- Markov blanket-based approach for learning multi-dimensional Bayesian network classifiers: an application to predict the European Quality of Life-5 Dimensions (EQ-5D) from the 39-item Parkinson’s Disease Questionnaire (PDQ-39).J Biomed Inform. 2012; 45: 1175-1184
- Use of large-scale HRQoL datasets to generate individualised predictions and inform patients about the likely benefit of surgery.Qual Life Res. 2017; 26: 2497-2505
- Mapping MacNew Heart Disease Quality of Life Questionnaire onto country-specific EQ-5D-5L utility scores: a comparison of traditional regression models with a machine learning technique.Eur J Health Econ. 2021; 22: 341-350
- Validity of the EQ-5D-5L questionnaire among the general population of Poland.Qual Life Res. 2021; 30: 817-829
- Tails from the peak district: adjusted limited dependent variable mixture models of EQ-5D questionnaire health state utility values.Value Health. 2012; 15: 550-561
- Missing covariate data in clinical research: when and when not to use the missing-indicator method for analysis.CMAJ. 2012; 184: 1265-1269
- Missing data should be handled differently for prediction than for description or causal explanation.J Clin Epidemiol. 2020; 125: 183-187
Song M, Zhou X, Pazaris M, Spiegelman D. The missing covariate indicator method is nearly valid almost always. Preprint. Posted online October 30, 2021. ArXiv:211100138. https://doi.org/10.48550/arXiv.2111.00138.
- XGBoost in handling missing values for life insurance risk prediction.SN Appl Sci. 2020; 2: 1
- Multiple imputation for missing data in epidemiological and clinical research: potential and pitfalls.BMJ. 2009; 338: b2393
- Modeling of missing data imputation using additive lasso regression model in Microsoft Azure.J Eng Appl Sci. 2018; 13: 6324-6334
- A review of studies mapping (or cross walking) non-preference based measures of health to generic preference-based measures.Eur J Health Econ. 2010; 11: 215-225
- A US population health survey on the impact of COVID-19 using the EQ-5D-5L.J Gen Intern Med. 2021; 36: 1292-1301
Article info
Publication history
Identification
Copyright
User license
Creative Commons Attribution (CC BY 4.0) |
Permitted
- Read, print & download
- Redistribute or republish the final article
- Text & data mine
- Translate the article
- Reuse portions or extracts from the article in other works
- Sell or re-use for commercial purposes
Elsevier's open access license policy