Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
Evaluating ChatGPT-5 for the Diagnosis of Benign Skin Lesions: Insights from Macroscopic and Dermoscopic Imaging
0
Zitationen
3
Autoren
2026
Jahr
Abstract
Background: While the potential of ChatGPT in the domain of medical diagnosis is noteworthy, the subject is intricate and has been examined in numerous studies across various medical disciplines. In this context, the objective of this study is to utilize ChatGPT-5 to evaluate its diagnostic accuracy for benign skin lesions using macroscopic and dermoscopic images. Methods: During the in-person examination, the dermatologist documented macroscopic and dermoscopic images of each of the 40 patients. These images, along with basic clinical information, were uploaded to ChatGPT-5. The evaluation process was meticulously structured into two distinct phases. In the initial phase, the presentation was limited to macroscopic images alone. In the subsequent phase, the presentation expanded to encompass both macroscopic and dermoscopic images. The model was tasked with making a preliminary diagnosis and, in the event of an inaccuracy, was expected to provide three differential diagnoses. The model's accuracy was assessed by comparing its diagnoses with the histopathological results. Results: In the evaluation conducted with ChatGPT-5, the diagnostic accuracy based solely on macroscopic images was 32.5%, whereas the accuracy for combined macroscopic and dermoscopic images decreased to 27.5% (p = 0.450). When three differential diagnoses were considered, the correct diagnosis was achieved in 48.1% of cases using macroscopic images, whereas this rate declined to 29.6% with the inclusion of dermoscopic images (p < 0.001). Conclusion: ChatGPT-5 demonstrated modest diagnostic accuracy for benign skin lesions, with performance declined when dermoscopic images were included. These results suggest that ChatGPT-5 should be considered a supportive aid rather than a standalone diagnostic tool.
Ähnliche Arbeiten
Explainable Artificial Intelligence (XAI): Concepts, taxonomies, opportunities and challenges toward responsible AI
2019 · 8.357 Zit.
Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead
2019 · 8.221 Zit.
High-performance medicine: the convergence of human and artificial intelligence
2018 · 7.640 Zit.
Proceedings of the 19th International Joint Conference on Artificial Intelligence
2005 · 5.776 Zit.
Peeking Inside the Black-Box: A Survey on Explainable Artificial Intelligence (XAI)
2018 · 5.482 Zit.