Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
Clinical decision support of advanced large language models in endodontic disease
1
Zitationen
8
Autoren
2025
Jahr
Abstract
Background/purpose: Large language models (LLMs) exhibit significant potential for clinical decision support, yet their application in endodontic disease remains underexplored. Materials and methods: This study assessed the decision-making capabilities of three advanced LLMs (GPT-4o, Claude 3.5, and Grok2) in specialized endodontic contexts. A question bank of 421 multiple-choice questions was constructed across 27 core endodontic topics, including theory, procedures, and 35 complex cases. The three LLMs were tested using standardized prompts, with performance evaluated via topic-stratified accuracy analysis. Results: Claude 3.5 achieved the highest overall accuracy (73.39 %), followed by Grok2 (66.27 %) and GPT-4o (46.32 %). Grok2 excelled in complex case analysis (69.57 %). The models performed strongly in theoretical domains (e.g., clinical examination, structural function, pharmacology) but showed limitations in complex scenarios and procedural techniques. Conclusion: LLMs hold promise as endodontic decision support tools, though domain-specific refinement is essential for effective clinical application.
Ähnliche Arbeiten
Explainable Artificial Intelligence (XAI): Concepts, taxonomies, opportunities and challenges toward responsible AI
2019 · 8.700 Zit.
Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead
2019 · 8.605 Zit.
High-performance medicine: the convergence of human and artificial intelligence
2018 · 8.133 Zit.
BioBERT: a pre-trained biomedical language representation model for biomedical text mining
2019 · 6.873 Zit.
Proceedings of the 19th International Joint Conference on Artificial Intelligence
2005 · 5.781 Zit.