Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
Accuracy of ChatGPT‐Generated Information on Head and Neck and Oromaxillofacial Surgery: A Multicenter Collaborative Analysis
118
Zitationen
22
Autoren
2023
Jahr
Abstract
OBJECTIVE: To investigate the accuracy of Chat-Based Generative Pre-trained Transformer (ChatGPT) in answering questions and solving clinical scenarios of head and neck surgery. STUDY DESIGN: Observational and valuative study. SETTING: Eighteen surgeons from 14 Italian head and neck surgery units. METHODS: A total of 144 clinical questions encompassing different subspecialities of head and neck surgery and 15 comprehensive clinical scenarios were developed. Questions and scenarios were inputted into ChatGPT4, and the resulting answers were evaluated by the researchers using accuracy (range 1-6), completeness (range 1-3), and references' quality Likert scales. RESULTS: The overall median score of open-ended questions was 6 (interquartile range[IQR]: 5-6) for accuracy and 3 (IQR: 2-3) for completeness. Overall, the reviewers rated the answer as entirely or nearly entirely correct in 87.2% of cases and as comprehensive and covering all aspects of the question in 73% of cases. The artificial intelligence (AI) model achieved a correct response in 84.7% of the closed-ended questions (11 wrong answers). As for the clinical scenarios, ChatGPT provided a fully or nearly fully correct diagnosis in 81.7% of cases. The proposed diagnostic or therapeutic procedure was judged to be complete in 56.7% of cases. The overall quality of the bibliographic references was poor, and sources were nonexistent in 46.4% of the cases. CONCLUSION: The results generally demonstrate a good level of accuracy in the AI's answers. The AI's ability to resolve complex clinical scenarios is promising, but it still falls short of being considered a reliable support for the decision-making process of specialists in head-neck surgery.
Ähnliche Arbeiten
Explainable Artificial Intelligence (XAI): Concepts, taxonomies, opportunities and challenges toward responsible AI
2019 · 8.663 Zit.
Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead
2019 · 8.576 Zit.
High-performance medicine: the convergence of human and artificial intelligence
2018 · 8.091 Zit.
BioBERT: a pre-trained biomedical language representation model for biomedical text mining
2019 · 6.859 Zit.
Proceedings of the 19th International Joint Conference on Artificial Intelligence
2005 · 5.781 Zit.
Autoren
- Luigi Angelo Vaira
- Jérôme R. Lechien
- Vincenzo Abbate
- Fabiana Allevi
- Giovanni Audino
- Giada Anna Beltramini
- Michela Bergonzani
- Alessandro Bolzoni
- Umberto Committeri
- Salvatore Crimi
- Guido Gabriele
- F. Lonardi
- Fabio Maglitto
- Marzia Petrocelli
- Resi Pucci
- Gianmarco Saponaro
- Alessandro Tel
- Valentino Vellone
- Carlos M. Chiesa‐Estomba
- Paolo Boscolo‐Rizzo
- Giovanni Salzano
- Giacomo De Riu
Institutionen
- University of Sassari(IT)
- Zero to Three(US)
- University of Mons(BE)
- University of Naples Federico II(IT)
- AOL (United States)(US)
- University of Milan(IT)
- Ospedale San Paolo(IT)
- Fondazione IRCCS Ca' Granda Ospedale Maggiore Policlinico(IT)
- University of Parma(IT)
- University of Catania(IT)
- Policlinico Universitario di Catania(IT)
- University of Siena(IT)
- University of Verona(IT)
- University of Bari Aldo Moro(IT)
- Nini Hospital(LB)
- Carlo Forlanini Hospital(IT)
- Università Cattolica del Sacro Cuore(IT)
- University of Udine(IT)
- Santa Maria Nuova Hospital(IT)
- Biogipuzkoa Health Research Institute(ES)
- University of Trieste(IT)
- Sociedad Hispano Mundial(ES)