Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
Hidden flaws behind expert-level accuracy of multimodal GPT-4 vision in medicine
2
Zitationen
18
Autoren
2024
Jahr
Abstract
(NEJM) Image Challenges - an imaging quiz designed to test the knowledge and diagnostic capabilities of medical professionals. Evaluation results confirmed that GPT-4V performs comparatively to human physicians regarding multi-choice accuracy (81.6% vs. 77.8%). GPT-4V also performs well in cases where physicians incorrectly answer, with over 78% accuracy. However, we discovered that GPT-4V frequently presents flawed rationales in cases where it makes the correct final choices (35.5%), most prominent in image comprehension (27.2%). Regardless of GPT-4V's high accuracy in multi-choice questions, our findings emphasize the necessity for further in-depth evaluations of its rationales before integrating such multimodal AI models into clinical workflows.
Ähnliche Arbeiten
Explainable Artificial Intelligence (XAI): Concepts, taxonomies, opportunities and challenges toward responsible AI
2019 · 8.697 Zit.
Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead
2019 · 8.602 Zit.
High-performance medicine: the convergence of human and artificial intelligence
2018 · 8.127 Zit.
BioBERT: a pre-trained biomedical language representation model for biomedical text mining
2019 · 6.872 Zit.
Proceedings of the 19th International Joint Conference on Artificial Intelligence
2005 · 5.781 Zit.
Autoren
Institutionen
- National Institutes of Health(US)
- United States National Library of Medicine(US)
- University of Pittsburgh(US)
- Cornell University(US)
- Weill Cornell Medicine(US)
- New York University(US)
- Harvard University(US)
- Massachusetts General Hospital(US)
- National Institutes of Health Clinical Center(US)
- Southwestern Medical Center
- The University of Texas Southwestern Medical Center(US)
- MetroHealth Medical Center(US)
- University of California San Diego(US)
- University of Arkansas for Medical Sciences(US)
- University of Pittsburgh Medical Center(US)
- National Eye Institute(US)