Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
AI at the Helm: Evaluating Claude 3.5 Sonet and ChatGPT-4.0 in Tympanoplasty Management
2
Zitationen
2
Autoren
2026
Jahr
Abstract
BACKGROUND: Artificial intelligence (AI) is increasingly being integrated into health care, offering new possibilities for postoperative management. Large language models (LLMs) like ChatGPT-4 and Claude 3.5 Sonet have demonstrated potential in patient education and clinical support. This study evaluates their effectiveness in providing postoperative guidance following tympanoplasty, focusing on accuracy, clarity, and relevance. METHODS: Fifteen frequently asked postoperative tympanoplasty questions were compiled from 50 patients and validated by 15 otolaryngologists-ChatGPT-4 and Claude 3.5 Sonet generated responses under standardized conditions. AI-generated responses were assessed by the expert panel using a 5-point Likert scale for accuracy, response time, clarity, and relevance. Advanced statistical analysis was conducted to compare the models' performance, including Cohen kappa for inter-rater reliability, effect size calculations, and P -value analysis. RESULTS: Claude 3.5 Sonet consistently outperformed ChatGPT-4 across all evaluated parameters. It demonstrated superior accuracy, faster response times, improved clarity, and higher relevance in patient education ( P <0.001). Statistical analysis confirmed significant differences, with Claude achieving stronger inter-rater reliability and response consistency. CONCLUSION: Claude 3.5 Sonet demonstrated a notable advantage over ChatGPT-4 in providing structured and clinically accurate postoperative tympanoplasty guidance. These findings suggest that AI-driven conversational agents can enhance patient education and support postoperative care. However, further research is necessary to refine AI-based tools and evaluate their broader applicability in clinical practice. LEVEL OF EVIDENCE: Level III.
Ähnliche Arbeiten
Explainable Artificial Intelligence (XAI): Concepts, taxonomies, opportunities and challenges toward responsible AI
2019 · 8.693 Zit.
Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead
2019 · 8.598 Zit.
High-performance medicine: the convergence of human and artificial intelligence
2018 · 8.124 Zit.
BioBERT: a pre-trained biomedical language representation model for biomedical text mining
2019 · 6.871 Zit.
Proceedings of the 19th International Joint Conference on Artificial Intelligence
2005 · 5.781 Zit.