Effects of a Differential Diagnosis List of Artificial Intelligence on Differential Diagnoses by Physicians: An Exploratory Analysis of Data from a Randomized Controlled Study
Abstract
1. Introduction
2. Materials and Methods
2.1. Study Design, Participants, Materials, and Intervention
2.2. Data Collection and Outcomes
2.3. Statistical Methods
3. Results
3.1. Baseline Characteristics of Physicians Who Participated in the Study
3.2. The Number of Physician Diagnoses
3.3. Primary Outcome
3.4. Subgroup Analysis
4. Discussion
5. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Committee on Diagnostic Error in Health Care; Board on Health Care Services; Institute of Medicine. The National Academies of Sciences, Engineering, and Medicine Improving Diagnosis in Health Care; Balogh, E.P., Miller, B.T., Ball, J.R., Eds.; National Academies Press: Washington, DC, USA, 2015; p. 21794. ISBN 978-0-309-37769-0.
- Leeds, F.S.; Atwa, K.M.; Cook, A.M.; Conway, K.A.; Crawford, T.N. Teaching Heuristics and Mnemonics to Improve Generation of Differential Diagnoses. Med. Educ. Online 2020, 25, 1742967. [Google Scholar] [CrossRef] [PubMed]
- Müller, L.; Gangadharaiah, R.; Klein, S.C.; Perry, J.; Bernstein, G.; Nurkse, D.; Wailes, D.; Graham, R.; El-Kareh, R.; Mehta, S.; et al. An Open Access Medical Knowledge Base for Community Driven Diagnostic Decision Support System Development. BMC Med. Inform. Decis. Mak. 2019, 19, 93. [Google Scholar] [CrossRef] [PubMed]
- Krupat, E.; Wormwood, J.; Schwartzstein, R.M.; Richards, J.B. Avoiding Premature Closure and Reaching Diagnostic Accuracy: Some Key Predictive Factors. Med. Educ. 2017, 51, 1127–1137. [Google Scholar] [CrossRef] [PubMed]
- Shimizu, T.; Matsumoto, K.; Tokuda, Y. Effects of the Use of Differential Diagnosis Checklist and General De-Biasing Checklist on Diagnostic Performance in Comparison to Intuitive Diagnosis. Med. Teach. 2013, 35, e1218–e1229. [Google Scholar] [CrossRef] [PubMed]
- Ramnarayan, P.; Cronje, N.; Brown, R.; Negus, R.; Coode, B.; Moss, P.; Hassan, T.; Hamer, W.; Britto, J. Validation of a Diagnostic Reminder System in Emergency Medicine: A Multi-Centre Study. Emerg. Med. J. 2007, 24, 619–624. [Google Scholar] [CrossRef]
- Bond, W.F.; Schwartz, L.M.; Weaver, K.R.; Levick, D.; Giuliano, M.; Graber, M.L. Differential Diagnosis Generators: An Evaluation of Currently Available Computer Programs. J. Gen. Intern. Med. 2012, 27, 213–219. [Google Scholar] [CrossRef] [PubMed]
- Riches, N.; Panagioti, M.; Alam, R.; Cheraghi-Sohi, S.; Campbell, S.; Esmail, A.; Bower, P. The Effectiveness of Electronic Differential Diagnoses (DDX) Generators: A Systematic Review and Meta-Analysis. PLoS ONE 2016, 11, e0148991. [Google Scholar] [CrossRef]
- Martinez-Franco, A.I.; Sanchez-Mendiola, M.; Mazon-Ramirez, J.J.; Hernandez-Torres, I.; Rivero-Lopez, C.; Spicer, T.; Martinez-Gonzalez, A. Diagnostic Accuracy in Family Medicine Residents Using a Clinical Decision Support System (DXplain): A Randomized-Controlled Trial. Diagnosis 2018, 5, 71–76. [Google Scholar] [CrossRef]
- Schwitzguebel, A.J.-P.; Jeckelmann, C.; Gavinio, R.; Levallois, C.; Benaïm, C.; Spechbach, H. Differential Diagnosis Assessment in Ambulatory Care With an Automated Medical History–Taking Device: Pilot Randomized Controlled Trial. JMIR Med. Inform. 2019, 7, e14044. [Google Scholar] [CrossRef] [PubMed]
- Friedman, C.P.; Elstein, A.S.; Wolf, F.M.; Murphy, G.C.; Franz, T.M.; Heckerling, P.S.; Fine, P.L.; Miller, T.M.; Abraham, V. Enhancement of Clinicians’ Diagnostic Reasoning by Computer-Based Consultation: A Multisite Study of 2 Systems. JAMA 1999, 282, 1851–1856. [Google Scholar] [CrossRef]
- Kostopoulou, O.; Sirota, M.; Round, T.; Samaranayaka, S.; Delaney, B.C. The Role of Physicians’ First Impressions in the Diagnosis of Possible Cancers without Alarm Symptoms. Med. Decis. Mak. 2017, 37, 9–16. [Google Scholar] [CrossRef]
- McLaughlin, K.; Heemskerk, L.; Herman, R.; Ainslie, M.; Rikers, R.M.; Schmidt, H.G. Initial Diagnostic Hypotheses Bias Analytic Information Processing in Non-Visual Domains. Med. Educ. 2008, 42, 496–502. [Google Scholar] [CrossRef] [PubMed]
- Kostopoulou, O.; Rosen, A.; Round, T.; Wright, E.; Douiri, A.; Delaney, B. Early Diagnostic Suggestions Improve Accuracy of GPs: A Randomised Controlled Trial Using Computer-Simulated Patients. Br. J. Gen. Pract. 2015, 65, e49–e54. [Google Scholar] [CrossRef] [PubMed]
- Kostopoulou, O.; Lionis, C.; Angelaki, A.; Ayis, S.; Durbaba, S.; Delaney, B.C. Early Diagnostic Suggestions Improve Accuracy of Family Physicians: A Randomized Controlled Trial in Greece. Fam. Pract. 2015, 32, 323–328. [Google Scholar] [CrossRef]
- Kostopoulou, O.; Porat, T.; Corrigan, D.; Mahmoud, S.; Delaney, B.C. Diagnostic Accuracy of GPs When Using an Early-Intervention Decision Support System: A High-Fidelity Simulation. Br. J. Gen. Pract. 2017, 67, e201–e208. [Google Scholar] [CrossRef]
- Harada, Y.; Katsukura, S.; Kawamura, R.; Shimizu, T. Efficacy of Artificial-Intelligence-Driven Differential-Diagnosis List on the Diagnostic Accuracy of Physicians: An Open-Label Randomized Controlled Study. Int. J. Environ. Res. Public Health 2021, 18, 2086. [Google Scholar] [CrossRef] [PubMed]
- Wolf, F.M.; Friedman, C.P.; Elstein, A.S.; Miller, J.G.; Murphy, G.C.; Heckerling, P.; Fine, P.; Miller, T.; Sisson, J.; Barlas, S.; et al. Changes in Diagnostic Decision-Making after a Computerized Decision Support Consultation Based on Perceptions of Need and Helpfulness: A Preliminary Report. Proc. AMIA Annu. Fall. Symp. 1997, 263–267. [Google Scholar]
- Berner, E.S.; Maisiak, R.S.; Heuderbert, G.R.; Young, K.R. Clinician Performance and Prominence of Diagnoses Displayed by a Clinical Diagnostic Decision Support System. AMIA Annu. Symp. Proc. 2003, 2003, 76–80. [Google Scholar]
- Wickens, C.D.; Dixon, S.R. The Benefits of Imperfect Diagnostic Automation: A Synthesis of the Literature. Theor. Issues Ergon. Sci. 2007, 8, 201–212. [Google Scholar] [CrossRef]
- Goddard, K.; Roudsari, A.; Wyatt, J.C. Automation Bias: Empirical Results Assessing Influencing Factors. Int. J. Med. Inform. 2014, 83, 368–375. [Google Scholar] [CrossRef]
- Dreiseitl, S.; Binder, M. Do Physicians Value Decision Support? A Look at the Effect of Decision Support Systems on Physician Opinion. Artif. Intell. Med. 2005, 33, 25–30. [Google Scholar] [CrossRef] [PubMed]
- Lee, J.D.; See, K.A. Trust in Automation: Designing for Appropriate Reliance. Hum. Factors 2004, 46, 50–80. [Google Scholar] [CrossRef] [PubMed]
- Cabitza, F.; Campagner, A.; Sconfienza, L.M. As If Sand Were Stone. New Concepts and Metrics to Probe the Ground on Which to Build Trustable AI. BMC Med. Inform. Decis. Mak. 2020, 20. [Google Scholar] [CrossRef] [PubMed]
- Bruckert, S.; Finzel, B.; Schmid, U. The Next Generation of Medical Decision Support: A Roadmap Toward Transparent Expert Companions. Front. Artif. Intell. 2020, 3. [Google Scholar] [CrossRef] [PubMed]
| With AI Differential List | Without AI Differential List | p Value | |
|---|---|---|---|
| Total | 490/528 (92.8%) | 485/528 (91.9%) | 0.64 | 
| The rank of physician diagnosis | |||
| 1 | 176/176 (100%) | 176/176 (100%) | >0.99 | 
| 2 | 173/176 (98.3%) | 172/176 (97.7%) | >0.99 | 
| 3 | 141/176 (80.1%) | 137/176 (77.8%) | 0.69 | 
| Sex | |||
| Male | 313/336 (93.2%) | 401/432 (92.8%) | 0.97 | 
| Female | 177/192 (92.2%) | 84/96 (87.5%) | 0.28 | 
| Experience | |||
| Intern | 135/144 (93.8%) | 80/96 (83.3%) | 0.02 | 
| Resident | 176/192 (91.7%) | 185/192 (96.4%) | 0.09 | 
| Attending physician | 179/192 (93.2%) | 220/240 (91.7%) | 0.67 | 
| Trust in AI | |||
| Yes | 268/288 (93.1%) | 311/336 (92.6%) | 0.93 | 
| No | 222/240 (92.5%) | 174/192 (90.6%) | 0.60 | 
| AI correctness | |||
| AI correct | 246/264 (93.2%) | 237/264 (89.8%) | 0.21 | 
| AI incorrect | 244/264 (92.4%) | 248/264 (93.9%) | 0.60 | 
| With AI Differential List | Without AI Differential List | p Value | |
|---|---|---|---|
| The rank of physician diagnosis | |||
| 1 | 141/176 (80.1%) | 123/176 (69.9%) | 0.04 | 
| 2 | 116/173 (67.1%) | 85/172 (49.4%) | 0.001 | 
| 3 | 87/141 (61.7%) | 59/137 (43.1%) | 0.003 | 
| AI correctness | |||
| AI correct | 207/246 (84.1%) | 168/237 (70.9%) | <0.001 | 
| AI incorrect | 137/244 (56.1%) | 99/248 (39.9%) | <0.001 | 
| With AI Differential List | Without AI Differential List | p Value | |
|---|---|---|---|
| Sex | |||
| Male | 230/313 (73.5%) | 224/401 (55.9%) | <0.001 | 
| Female | 114/177 (64.4%) | 43/84 (51.2%) | 0.06 | 
| Experience | |||
| Intern | 107/135 (79.3%) | 42/80 (52.5%) | <0.001 | 
| Resident | 123/176 (69.9%) | 101/185 (54.6%) | 0.004 | 
| Attending physician | 114/179 (63.7%) | 124/220 (56.4%) | 0.17 | 
| Trust in AI | |||
| Yes | 209/268 (78.0%) | 171/311 (55.0%) | <0.001 | 
| No | 135/222 (60.8%) | 96/174 (55.2%) | 0.30 | 
| Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. | 
© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Harada, Y.; Katsukura, S.; Kawamura, R.; Shimizu, T. Effects of a Differential Diagnosis List of Artificial Intelligence on Differential Diagnoses by Physicians: An Exploratory Analysis of Data from a Randomized Controlled Study. Int. J. Environ. Res. Public Health 2021, 18, 5562. https://doi.org/10.3390/ijerph18115562
Harada Y, Katsukura S, Kawamura R, Shimizu T. Effects of a Differential Diagnosis List of Artificial Intelligence on Differential Diagnoses by Physicians: An Exploratory Analysis of Data from a Randomized Controlled Study. International Journal of Environmental Research and Public Health. 2021; 18(11):5562. https://doi.org/10.3390/ijerph18115562
Chicago/Turabian StyleHarada, Yukinori, Shinichi Katsukura, Ren Kawamura, and Taro Shimizu. 2021. "Effects of a Differential Diagnosis List of Artificial Intelligence on Differential Diagnoses by Physicians: An Exploratory Analysis of Data from a Randomized Controlled Study" International Journal of Environmental Research and Public Health 18, no. 11: 5562. https://doi.org/10.3390/ijerph18115562
APA StyleHarada, Y., Katsukura, S., Kawamura, R., & Shimizu, T. (2021). Effects of a Differential Diagnosis List of Artificial Intelligence on Differential Diagnoses by Physicians: An Exploratory Analysis of Data from a Randomized Controlled Study. International Journal of Environmental Research and Public Health, 18(11), 5562. https://doi.org/10.3390/ijerph18115562
 
         
                                                


 
       