Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
Grounding large language models in clinical diagnostics
0
Zitationen
19
Autoren
2026
Jahr
Abstract
Although Large Language Models (LLMs) possess extensive medical knowledge, they often struggle to emulate the complex, iterative process of real-world clinical diagnosis. To address this limitation, we present ClinDiag-GPT, a specialized LLM fine-tuned to execute full diagnostic procedures, supported by the ClinDiag-Framework evaluation system and ClinDiag-Benchmark, a dataset comprising 4,421 real-world cases. Our evaluation shows that existing LLMs, including GPT-4o-mini, GPT-4o, Claude-3-Haiku, Qwen2.5-72b, Qwen2.5-32b, and Qwen2.5-14b, while proficient in static tasks, fall short in dynamic diagnostic workflows and frequently commit clinical errors. In contrast, ClinDiag-GPT, trained on clinical cases, outperforms all baseline models in both diagnostic accuracy and procedural performance. Furthermore, a comparative analysis reveals that collaboration between physicians and ClinDiag-GPT yields higher diagnostic accuracy and efficiency compared to either working alone, demonstrating the utility of ClinDiag-GPT as a clinical assistant.
Ähnliche Arbeiten
Standards and guidelines for the interpretation of sequence variants: a joint consensus recommendation of the American College of Medical Genetics and Genomics and the Association for Molecular Pathology
2015 · 31.240 Zit.
A global reference for human genetic variation
2015 · 19.579 Zit.
The cBio Cancer Genomics Portal: An Open Platform for Exploring Multidimensional Cancer Genomics Data
2012 · 18.149 Zit.
ANNOVAR: functional annotation of genetic variants from high-throughput sequencing data
2010 · 15.357 Zit.
A method and server for predicting damaging missense mutations
2010 · 13.473 Zit.