Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
ChatGPT & clinical decision-making: a cross-sectional study on Italian Medical Residency Test
7
Zitationen
6
Autoren
2023
Jahr
Abstract
Abstract Background The increasing application of artificial intelligence (AI) in healthcare offers significant opportunities for improving clinical decision-making and medical education. This study assesses the performance of ChatGPT, a large language model (LLM), on the Italian State Exam for Medical Residency (SSM) test, to evaluate its potential as a tool for medical education and clinical decision-making support. Given the high stakes of medical decision-making, understanding the capabilities of AI tools like ChatGPT in this context is crucial. Methods A cross-sectional study design was employed, analyzing 136 questions from the official SSM test. ChatGPT's responses were compared to the performance of medical doctors who took the test in 2022. Questions were classified into clinical cases (CC) and notional questions (NQ). The study assessed ChatGPT's overall accuracy, performance on CC and NQ, and compared its results with participating medical doctors. Results ChatGPT achieved an overall accuracy of 90.44%, with higher performance on clinical cases (92.45%) compared to notional questions (89.15%). ChatGPT's performance was higher than 99.6% of the medical doctor participants. Conclusions Based on the data, ChatGPT shows promise as a useful tool in clinical decision-making, particularly in clinical reasoning. This study recommends further research to explore the potential applications and implementation of LLMs in medical education and medical practice, as well as the initiation of procedures and policies for their safe and effective integration into healthcare systems. Key messages • ChatGPT outperforms 99.6% of medical doctors in the Italian State Exam for Medical Residency, demonstrating potential as a clinical decision-making tool. • Further research is needed to explore LLM implementation in medical education and practice, and to develop policies for safe AI integration.
Ähnliche Arbeiten
Explainable Artificial Intelligence (XAI): Concepts, taxonomies, opportunities and challenges toward responsible AI
2019 · 8.336 Zit.
Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead
2019 · 8.207 Zit.
High-performance medicine: the convergence of human and artificial intelligence
2018 · 7.607 Zit.
Proceedings of the 19th International Joint Conference on Artificial Intelligence
2005 · 5.776 Zit.
Peeking Inside the Black-Box: A Survey on Explainable Artificial Intelligence (XAI)
2018 · 5.476 Zit.