Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
ECOSBot: a multicenter validation pilot study of a generative AI tool for OSCE-based nephrology training
3
Zitationen
13
Autoren
2025
Jahr
Abstract
Background: Developing diagnostic reasoning in nephrology is particularly challenging due to its pathophysiological complexity and reliance on abstract clinical data. Objective Structured Clinical Examinations (OSCEs) are pivotal for nephrology training but remain resource-intensive and difficult to scale. Generative artificial intelligence (AI) offers a promising alternative, yet its capacity to emulate nephrology-specific OSCEs has not been formally assessed. Methods: We developed ECOSBot, a web-based tool powered by GPT-4o, to simulate both standardized patients and examiners for nephrology-focused OSCEs. In this multicenter prospective study, undergraduate medical students from five French medical schools interacted with ECOSBot across four clinical stations. All interactions were double-rated by nephrology faculty members to establish a gold standard. ECOSBot's performance was evaluated against this standard using four criteria (script coverage, authenticity, correctness and relevance) for patient simulation, and via checklists and competency-based ratings for examiner scoring. Usability was assessed using the Chatbot Usability Questionnaire (CUQ), adapted to include six items on feedback quality. Results: Ninety-one students generated 2939 prompts across 184 OSCE sessions. ECOSBot demonstrated high fidelity in patient simulation: authenticity 98.6% [95% confidence interval (CI) 98.2-99.0], correctness 98.3% (95% CI 97.9-98.7) and relevance 99.2% (95% CI 98.9-99.5), including during exchanges not explicitly covered by the pre-specified scenario. As an examiner, ECOSBot showed strong agreement with human raters on global scores [intraclass correlation coefficient (ICC) = 0.94, 95% CI 0.91-0.96], consistent across case formats, training levels and institutions. However, scoring of attitude and communication skills was less reliable (ICC = 0.44, 95% CI 0.28-0.58). Median CUQ score was 69.7/100, with 91.7% of students finding the tool highly useful for OSCE preparation in nephrology. Conclusions: ECOSBot reliably simulated both roles in nephrology OSCEs with high fidelity and strong alignment with expert rating. While challenges remain for subjective skill assessment, this tool offers a scalable and autonomous solution to enhance nephrology education.
Ähnliche Arbeiten
Explainable Artificial Intelligence (XAI): Concepts, taxonomies, opportunities and challenges toward responsible AI
2019 · 8.687 Zit.
Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead
2019 · 8.591 Zit.
High-performance medicine: the convergence of human and artificial intelligence
2018 · 8.114 Zit.
BioBERT: a pre-trained biomedical language representation model for biomedical text mining
2019 · 6.867 Zit.
Proceedings of the 19th International Joint Conference on Artificial Intelligence
2005 · 5.781 Zit.
Autoren
Institutionen
- Inserm(FR)
- Université de Lille(FR)
- Institut Pasteur de Lille(FR)
- Hôpitaux Universitaires de Strasbourg(FR)
- French Clinical Research Infrastructure Network(FR)
- Université de Strasbourg(FR)
- Lille’s Cardiology Hospital(FR)
- European Genomic Institute for Diabetes(FR)
- Université de Tours(FR)
- Université Paris Cité(FR)
- Centre Hospitalier Universitaire de Tours(FR)
- methodS in Patient-centered outcomes and HEalth ResEarch
- Centre François Baclesse(LU)
- Normandie Université(FR)
- Centre Hospitalier Universitaire de Caen Normandie(FR)
- Université de Caen Normandie(FR)
- Aix-Marseille Université(FR)
- Hôpital de la Conception(FR)
- Centre National de la Recherche Scientifique(FR)
- Centre Hospitalier Universitaire d'Angers(FR)
- Université d'Angers(FR)
- Centre Hospitalier Universitaire de Lille(FR)
- Hôpital Roger Salengro(FR)