Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
Accuracy of AI Tools in the Diagnosis of Benign, Potentially Malignant and Malignant Oral Lesions: A Pilot Study
0
Zitationen
13
Autoren
2026
Jahr
Abstract
Background: Artificial intelligence (AI) is expected to play an increasingly important role in medicine and dentistry. While its diagnostic potential has been tested in various medical fields, limited research exists on its applications within oral medicine diagnoses using clinical images. Objective: This pilot study aimed to evaluate the diagnostic accuracy of ChatGPT, Gemini, and Copilot in identifying benign, potentially malignant, and malignant oral lesions. Methods: A cross-sectional study was conducted using clinical images from three categories: benign oromucosal conditions, oral potentially malignant disorders, and malignant oral lesions. Results: ChatGPT evaluated all images and consistently outperformed Copilot—and in some cases Gemini—across multiple diagnostic questions, with statistically significant advantages particularly in the cancer subgroup. Copilot showed the weakest performance, with high rates of missing evaluations and significantly lower proportions of correct responses in several analyses. Across both full-dataset and adjusted analyses, ChatGPT demonstrated the highest diagnostic performance overall. Diagnostic accuracy metrics for malignancy suspicion was similar for ChatGPT and Gemini. Several limitations such as sample size, lack of reproducibility testing and inability of some AI models to process images must be taken into account when interpreting the results. Conclusions: AI tools show promise but cannot yet replace clinical expertise. Further research and development are needed to improve the accuracy and applicability of AI diagnostic tools.
Ähnliche Arbeiten
Explainable Artificial Intelligence (XAI): Concepts, taxonomies, opportunities and challenges toward responsible AI
2019 · 8.391 Zit.
Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead
2019 · 8.257 Zit.
High-performance medicine: the convergence of human and artificial intelligence
2018 · 7.685 Zit.
Proceedings of the 19th International Joint Conference on Artificial Intelligence
2005 · 5.781 Zit.
Peeking Inside the Black-Box: A Survey on Explainable Artificial Intelligence (XAI)
2018 · 5.501 Zit.