Highlights
- •Artificial intelligence (AI) is increasingly being used in clinical applications. Nevertheless, its flexibility and difficulties around collecting data on its clinical impacts make value assessment challenging.
- •We use a value framework as the basis for assessing how AI may create value depending on how it is used. We also provide advice to health economists seeking to model AI’s clinical impacts.
- •There are multiple ways that AI challenges traditional health technology assessment methodology. We highlight a number of ways that health technology assessment methods will need to be further refined to accommodate AI.
Abstract
Objectives
Methods
Results
Conclusions
Keywords
Introduction
Stinton C, Jenkinson D, Adekanmbi V, Clarke A, Taylor-Phillips S. Does time of day influence cancer detection and recall rates in mammography? In: Proceeding from the SPIE 10136, Medical Imaging 2017: Image Perception, Observer Performance, and Technology Assessment; March 10, 2017; Orlando, FL. Abstract 10136B.

Evidence Creation and Generalizability
Challenge | Opportunity |
---|---|
Most evidence on clinical AI performance comes from retrospective studies | Outcome modeling provides methods for assessing clinical impact from limited evidence such as surrogate outcomes |
AI may perpetuate inequities in healthcare | A critical perspective on data and outcomes used and subgroup analysis can reveal the potential group harms of AI |
AI performance is often compared with clinician performance in unrealistic ways | Simulation methods can estimate outcomes in different settings and identify the types of collaboration between clinicians and AI for further research |
A single AI algorithm can be used at many different thresholds of sensitivity and specificity | Cost-effectiveness can be used to select thresholds that optimize value |
AI impacts on clinician productivity are uncertain | Sensitivity analysis can be used to explore how productivity impacts would affect the choice of how to implement AI |
Technological improvements and access to more data mean that AI will likely improve over time | Dynamic models, perhaps similar to those used for infectious disease modeling, can be expanded to simulate improved performance over time |
Coverage for AI is still very unclear | Cost-effectiveness modeling can be used to test different ways that coverage decisions can be used to incentivize appropriate and equitable use of AI |
Integration of AI Into Clinician Work

Speed and Real-Time Technological Innovation
Morey JR, Fiano E, Yaeger KA, Zhang X, Fifi JT. Impact of viz LVO on time-to-treatment and clinical outcomes in large vessel occlusion stroke patients presenting to primary stroke centers. Preprint. Posted online July 5, 2020. medRxiv 2020.07.02.20143834. https://doi.org/10.1101/2020.07.02.20143834.
Costs
Conclusions
Use case | Explanation | Examples | Potential sources of value | Special considerations |
---|---|---|---|---|
Enhance clinical possibilities | Performs tasks that are not possible for human clinicians or performs them with greater speed and accuracy than humans can |
|
|
|
Extend clinician expertise | Replicates the work of human clinicians to improve their accessibility, as in performing specialist tasks in a primary care office or clinical tasks outside of clinic, e.g., via smartphone |
|
|
|
Automate clinician work | Replaces human tasks in order to reduce clinician burden or enhance efficiency |
|
|
|
Article and Author Information
Acknowledgment
References
- Interview with Colin Garvey, Rensselaer Polytechnic Institute. Artificial intelligence and systems medicine convergence.Omics. 2018; 22: 130-132
- Introduction.in: Agrawal A. Gans J. Goldfarb A. The Economics of Artificial Intelligence. University of Chicago Press, Chicago, IL2019
- Computer-assisted diagnostic decision support: history, challenges, and possible paths forward.Adv Health Sci Educ Theory Pract. 2009; 14: 89-106
- High-performance medicine: the convergence of human and artificial intelligence.Nat Med. 2019; 25: 44-56
- Beyond prediction: using big data for policy problems.Science. 2017; 355: 483-485
Stinton C, Jenkinson D, Adekanmbi V, Clarke A, Taylor-Phillips S. Does time of day influence cancer detection and recall rates in mammography? In: Proceeding from the SPIE 10136, Medical Imaging 2017: Image Perception, Observer Performance, and Technology Assessment; March 10, 2017; Orlando, FL. Abstract 10136B.
- Combining mobile-health (mHealth) and artificial intelligence (AI) methods to avoid suicide attempts: the Smartcrises study protocol.BMC Psychiatry. 2019; 19: 277
- Translating artificial intelligence into clinical care.JAMA. 2016; 316: 2368-2369
- Artificial intelligence in the medical system: four roles for potential transformation. Preprint. Posted online March 18, 2019. University of Michigan Public Law Research Paper No. 631.(Accessed December 11, 2020)
- CheXNet: radiologist-level pneumonia detection on chest X-rays with deep learning. Preprint. Posted online November 14, 2017. arXiv:1711.05225.(Accessed December 11, 2020)
- Large-scale assessment of a smartwatch to identify atrial fibrillation.N Engl J Med. 2019; 381: 1909-1917
- Validation of automated screening for referable diabetic retinopathy with the IDx-DR device in the Hoorn Diabetes Care System.Acta Ophthalmol. 2018; 96: 63-68
- How should artificial intelligence screen for skin cancer and deliver diagnostic predictions to patients?.JAMA Dermatol. 2018; 154: 1383-1384
- Superior skin cancer classification by the combination of human and artificial intelligence.Eur J Cancer. 2019; 120: 114-121
- Reimagining clinical documentation with artificial intelligence.Mayo Clin Proc. 2018; 93: 563-565
- Prospective deployment of deep learning in MRI: a framework for important considerations, challenges, and recommendations for best practices.J Magn Reson Imaging. 2021; 54: 357-371
- Development of a taxonomy for health information technology.Stud Health Technol Inform. 2007; 129: 616-620
- Artificial intelligence in healthcare: past, present and future.Stroke Vasc Neurol. 2017; 2: 230-243
- Clinical information systems and artificial intelligence: recent research trends.Yearb Med Inform. 2019; 28: 83-94
- Models in the development of clinical practice guidelines.Ann Intern Med. 2015; 162: 530-531
- The ethics of AI in health care: a mapping review.Soc Sci Med. 2020; 260113172
Ostherr K. Artificial intelligence and medical humanities [published online July 11, 2020]. J Med Humanit. https://doi.org/10.1007/s10912-020-09636-4.
- An ethically mindful approach to AI for health care.Lancet. 2020; 395: 254-255
- Defining elements of value in health care—a health economics approach: an ISPOR Special Task Force Report:[3].Value Health. 2018; 21: 131-139
- Artificial intelligence versus clinicians: systematic review of design, reporting standards, and claims of deep learning studies.BMJ. 2020; 368: m689
- General wellness: policy for low risk devices; guidance for Industry and Food and Drug Administration staff; availability.Fed Regist. 2016; 81: 49993-49995
- Lifecycle regulation of artificial intelligence--and machine learning--based software devices in medicine.JAMA. 2019; 322: 2285-2286
- Direct-to-consumer medical machine learning and artificial intelligence applications.Nat Mach Intell. 2021; 3: 283-287
- How medical AI devices are evaluated: limitations and recommendations from an analysis of FDA approvals.Nat Med. 2021; 27: 582-584
- The proof of the pudding: in praise of a culture of real-world validation for medical artificial intelligence.Ann Transl Med. 2019; 7: 161
- Investigating the impacting factors for the healthcare professionals to adopt artificial intelligence-based medical diagnosis support system (AIMDSS).Ann Oper Res. 2020; 294: 567-592
- Adoption of artificial intelligence and machine learning is increasing, but irrational exuberance remains.NEJM Catal. 2020; 1: 2-18
- Variable generalization performance of a deep learning model to detect pneumonia in chest radiographs: a cross-sectional study.PLoS Med. 2018; 15e1002683
- Artificial intelligence and the future of global health.Lancet. 2020; 395: 1579-1586
- Welcoming new guidelines for AI clinical research.Nat Med. 2020; 26: 1318-1320
Beede E, Baylor E, Hersch F, et al. A human-centered evaluation of a deep learning system deployed in Clinics for the Detection of Diabetic Retinopathy. In: Proceedings of the 2020 CHI Conference on Human Factors in Computing Systems; Honolulu HI; April 21, 2020; 1-12.
- Dissecting racial bias in an algorithm used to manage the health of populations.Science. 2019; 366: 447-453
- A deep learning mammography-based model for improved breast cancer risk prediction.Radiology. 2019; 292: 60-66
- An algorithmic approach to reducing unexplained pain disparities in underserved populations.Nat Med. 2021; 27: 136-140
- Key principles of clinical validation, device approval, and insurance coverage decisions of artificial intelligence.Korean J Radiol. 2021; 22: 442-453
- Methodologic guide for evaluating clinical performance and effect of artificial intelligence technology for medical diagnosis and prediction.Radiology. 2018; 286: 800-809
- Machine learning for prediction in electronic health data.JAMA Netw Open. 2018; 1e181404
- Addressing bias in artificial intelligence in health care.JAMA. 2019; 322: 2377-2378
- External evaluation of 3 commercial artificial intelligence algorithms for independent assessment of screening mammograms.JAMA Oncol. 2020; 6: 1581-1588
- Evaluation of combined artificial intelligence and radiologist assessment to interpret screening mammograms [published correction appears in JAMA Netw Open. 2020;3(3):e204429].JAMA Netw Open. 2020; 3e200265
- Deep neural networks improve radiologists’ performance in breast cancer screening.IEEE Trans Med Imaging. 2020; 39: 1184-1194
- Evolution of insulin delivery devices: from syringes, pens, and pumps to DIY artificial pancreas.Diabetes Ther. 2020; 11: 1251-1269
- Prediction of melanoma evolution in melanocytic nevi via artificial intelligence: a call for prospective data [published correction appears in Eur J Cancer. 2019;123:171].Eur J Cancer. 2019; 119: 30-34
- Receiver operating characteristic (ROC) analysis: basic principles and applications in radiology.Eur J Radiol. 1998; 27: 88-94
- ROC analysis.in: Encyclopedia of Machine Learning and Data Mining. Springer, Boston, MA2016: 1-8
- International evaluation of an AI system for breast cancer screening.Nature. 2020; 577: 89-94
- Artificial intelligence, automation and work. National Bureau of Economic Research.(Accessed January 17, 2021)
- Mammographic breast density assessment using deep learning: clinical implementation.Radiology. 2019; 290: 52-58
- A model for types and levels of human interaction with automation.IEEE Trans Syst Man Cybern A Syst Hum. 2000; 30: 286-297
- Framing the challenges of artificial intelligence in medicine.BMJ Qual Saf. 2019; 28: 238-241
- Artificial intelligence in medical practice: the question to the answer?.Am J Med. 2018; 131: 129-133
- Deskilling in ophthalmology is the inevitable controllable?.Eye (Lond). 2019; 33: 347-348
- The role of trust in automation reliance.Int J Hum Comput Stud. 2003; 58: 697-718
- Anomalies: the endowment effect, loss aversion, and status quo bias.J Econ Perspect. 1991; 5: 193-206
- Communication practices of mammography facilities and timely follow-up of a screening mammogram with a BI-RADS 0 assessment.Acad Radiol. 2018; 25: 1118-1127
- Inadequate follow-up of abnormal screening mammograms: findings from the race differences in screening mammography process study (United States).Cancer Causes Control. 2005; 16: 809-821
- Black patients referred to a lung cancer screening program experience lower rates of screening and longer time to follow-up.BMC Cancer. 2020; 20: 561
- A study in transfer learning: leveraging data from multiple hospitals to enhance hospital-specific predictions.J Am Med Inform Assoc. 2014; 21: 699-706
- Transfer learning for clinical time series analysis using deep neural networks.J Healthc Inform Res. 2020; 4: 112-137
- Estimating dataset size requirements for classifying DNA microarray data.J Comput Biol. 2003; 10: 119-142
- Predicting sample size required for classification performance.BMC Med Inform Decis Mak. 2012; 12: 8
Morey JR, Fiano E, Yaeger KA, Zhang X, Fifi JT. Impact of viz LVO on time-to-treatment and clinical outcomes in large vessel occlusion stroke patients presenting to primary stroke centers. Preprint. Posted online July 5, 2020. medRxiv 2020.07.02.20143834. https://doi.org/10.1101/2020.07.02.20143834.
- Developing software precertification program: a working model. U.S. Food &Drug Administration.(Accessed December 11, 2020)
- Potential liability for physicians using artificial intelligence.JAMA. 2019; 322: 1765-1766
- Current state and near-term priorities for AI-enabled diagnostic support software in health care. Duke Margolis Center for Health Policy.(Accessed January 19, 2021)
- An overview of clinical decision support systems: benefits, risks, and strategies for success.NPJ Digit Med. 2020; 3: 17
- Artificial intelligence and liability in medicine: balancing safety and innovation.Milbank Q. 2021; 99: 629-647