Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
Artificial Intelligence in Craniosynostosis Surgery: A Comparison of Large Language Models in Answering Perioperative Care Questions
0
Zitationen
7
Autoren
2026
Jahr
Abstract
BACKGROUND: Craniosynostosis surgery poses complex challenges for caregivers. Often, they utilize large language models (LLMs) for preoperative and postoperative information. Although LLMs offer accessible guidance, persistent concerns center on their information quality and readability, especially in specialized surgical contexts. METHODS: This study evaluates the readability and quality of responses from 4 leading LLMs, ChatGPT-4o, Google Gemini 2.0, DeepSeek, and OpenEvidence, to 20 standardized perioperative questions about craniosynostosis surgery. Quality was assessed using modified DISCERN criteria, and readability was measured using SMOG Index. Statistical analysis included 1-way ANOVA and Tukey HSD. RESULTS: By SMOG score, OpenEvidence produced responses at the highest reading level (17.54), indicating a graduate-level comprehension requirement. ChatGPT (14.45), DeepSeek (14.40), and Google Gemini (15.17) generated information at an undergraduate reading level. By information quality, measured by modified DISCERN scores, Google Gemini achieved the highest score (42.95, P <0.001) out of maximum 45, significantly outperforming ChatGPT (36.25), DeepSeek (37.55), and OpenEvidence (36.75). Gemini's responses were rated highest in clarity, citation use, and support for shared decision-making. CONCLUSIONS: LLMs vary significantly in readability and information quality. Google Gemini offered the most trustworthy content, whereas DeepSeek was most accessible. No single model excelled across all dimensions, suggesting that clinicians should guide caregivers toward LLMs best suited to their literacy level. Generative AI holds promise for augmenting patient education in craniosynostosis care. However, it should be used alongside clinician input to ensure clarity, accuracy, and relevance.
Ähnliche Arbeiten
Explainable Artificial Intelligence (XAI): Concepts, taxonomies, opportunities and challenges toward responsible AI
2019 · 8.740 Zit.
Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead
2019 · 8.649 Zit.
High-performance medicine: the convergence of human and artificial intelligence
2018 · 8.202 Zit.
BioBERT: a pre-trained biomedical language representation model for biomedical text mining
2019 · 6.886 Zit.
Proceedings of the 19th International Joint Conference on Artificial Intelligence
2005 · 5.781 Zit.