Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
Quality of Information Provided by Artificial Intelligence Chatbots Surrounding the Management of Vestibular Schwannomas: A Comparative Analysis Between ChatGPT-4 and Claude 2
3
Zitationen
26
Autoren
2025
Jahr
Abstract
OBJECTIVE: To examine the quality of information provided by artificial intelligence platforms ChatGPT-4 and Claude 2 surrounding the management of vestibular schwannomas. STUDY DESIGN: Cross-sectional. SETTING: Skull base surgeons were involved from different centers and countries. INTERVENTION: Thirty-six questions regarding vestibular schwannoma management were tested. Artificial intelligence responses were subsequently evaluated by 19 lateral skull base surgeons using the Quality Assessment of Medical Artificial Intelligence (QAMAI) questionnaire, assessing "Accuracy," "Clarity," "Relevance," "Completeness," "Sources," and "Usefulness." MAIN OUTCOME MEASURE: The scores of the answers from both chatbots were collected and analyzed using the Student t test. Analysis of responses grouped by stakeholders was performed with McNemar test. Stuart-Maxwell test was used to compare reading level among chatbots. Intraclass correlation coefficient was calculated. RESULTS: ChatGPT-4 demonstrated significantly improved quality over Claude 2 in 14 of 36 (38.9%) questions, whereas higher-quality scores for Claude 2 were only observed in 2 (5.6%) answers. Chatbots exhibited variation across the dimensions of "Accuracy," "Clarity," "Completeness," "Relevance," and "Usefulness," with ChatGPT-4 demonstrating a statistically significant superior performance. However, no statistically significant difference was found in the assessment of "Sources." Additionally, ChatGPT-4 provided information at a significant lower reading grade level. CONCLUSIONS: Artificial intelligence platforms failed to consistently provide accurate information surrounding the management of vestibular schwannoma, although ChatGPT-4 achieved significantly higher scores in most analyzed parameters. These findings demonstrate the potential for significant misinformation for patients seeking information through these platforms.
Ähnliche Arbeiten
Explainable Artificial Intelligence (XAI): Concepts, taxonomies, opportunities and challenges toward responsible AI
2019 · 8.719 Zit.
Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead
2019 · 8.628 Zit.
High-performance medicine: the convergence of human and artificial intelligence
2018 · 8.176 Zit.
BioBERT: a pre-trained biomedical language representation model for biomedical text mining
2019 · 6.880 Zit.
Proceedings of the 19th International Joint Conference on Artificial Intelligence
2005 · 5.781 Zit.
Autoren
- Daniele Borsetto
- Egidio Sia
- Patrick Axon
- Neil Donnelly
- James R. Tysome
- Lukas Anschuetz
- Danièle Bernardeschi
- Vincenzo Capriotti
- Per Cayé‐Thomasen
- Niels West
- Isaac D. Erbele
- Sebastiano Franchella
- Annalisa Gatto
- Jeanette Hess-Erga
- Henricus P. M. Kunst
- John P. Marinelli
- Richard Mannion
- Benedict Panizza
- Franco Trabalzini
- Rupert Obholzer
- Luigi Angelo Vaira
- Jerry Polesel
- Fabiola Giudici
- Matthew L. Carlson
- Giancarlo Tirelli
- Paolo Boscolo‐Rizzo
Institutionen
- Cambridge University Hospitals NHS Foundation Trust(GB)
- University of Trieste(IT)
- University of Bern(CH)
- University Hospital of Bern(CH)
- Sorbonne Université(FR)
- Ospedale di San Donà di Piave(IT)
- Copenhagen University Hospital(DK)
- Rigshospitalet(DK)
- Brooke Army Medical Center(US)
- Joint Base San Antonio(US)
- University of Padua(IT)
- Haukeland University Hospital(NO)
- Radboud University Nijmegen(NL)
- Radboud University Medical Center(NL)
- Mayo Clinic(US)
- University of Cambridge(GB)
- Addenbrooke's Hospital(GB)
- Princess Alexandra Hospital(AU)
- Meyer Children's Hospital(IT)
- National Hospital for Neurology and Neurosurgery(GB)
- University College London(GB)
- University of Sassari(IT)
- Centro di Riferimento Oncologico(IT)
- Istituti di Ricovero e Cura a Carattere Scientifico(IT)