Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
Assessing the quality of AI-generated and physician-written discharge summaries: evaluation of an EHR-integrated tool in a Dutch academic hospital
0
Zitationen
67
Autoren
2026
Jahr
Abstract
BACKGROUND: Large language models (LLMs) offer potential to reduce administrative burden in clinical care by generating discharge summaries. Most prior evaluations have been limited to drafts, small cohorts, or non-integrated settings. Robust validation of fully automated, EHR-integrated systems in real-world practice is lacking. METHODS: This study was conducted in April 2025 at a Dutch academic hospital. A total of 292 paired discharge summaries from multiple departments were evaluated, each consisting of a physician-written and an LLM-generated version. Summaries were independently assessed by eight blinded clinicians using a 5-point Likert scale across completeness, correctness, and conciseness. Trustworthiness was also scored. Domain and total scores were compared with Wilcoxon signed-rank tests, and interrater reliability was quantified using Gwet's AC2. FINDINGS: LLM-generated summaries had lower completeness (4.50 (4.00-5.00) vs 5.00 (4.50-5.00); p < 0.001), similar correctness (5.00 (4.50-5.00) vs 5.00 (4.63-5.00); p = 0.14), and greater conciseness (5.00 (4.50-5.00) vs 4.50 (4.00-5.00); p < 0.001) compared with physician-written summaries. Total scores did not differ (14.00 (13.00-15.00) vs 14.00 (13.00-15.00); p = 0.34). Physician-written summaries were trusted by both reviewers in 279 (95.5%) cases, whereas LLM-generated summaries were trusted in 249 (85.3%) cases, partially trusted in 34 (11.6%), and rejected in 9 (3.1%). Interrater agreement for total scores was high (AC2 0.87, 95% CI 0.83-0.90 for LLM; 0.85, 95% CI 0.81-0.89 for physician). INTERPRETATION: Discharge summaries generated by an EHR-integrated LLM achieved quality ratings comparable to physician-written documents across multiple specialties, with no difference in total scores. Unlike earlier pilot work, this study demonstrates real-world feasibility of automated LLM use in clinical workflows at scale. With appropriate oversight and specialty-specific refinement, such systems could substantially reduce documentation burden while maintaining discharge summary quality. FUNDING: This research did not receive a specific grant from any funding agency in the public, commercial, or not-for-profit sectors.
Ähnliche Arbeiten
Explainable Artificial Intelligence (XAI): Concepts, taxonomies, opportunities and challenges toward responsible AI
2019 · 8.693 Zit.
Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead
2019 · 8.598 Zit.
High-performance medicine: the convergence of human and artificial intelligence
2018 · 8.124 Zit.
BioBERT: a pre-trained biomedical language representation model for biomedical text mining
2019 · 6.871 Zit.
Proceedings of the 19th International Joint Conference on Artificial Intelligence
2005 · 5.781 Zit.
Autoren
- Tarannom Mehri
- Tun Nadalini
- Anne H. Hoekman
- Tom P. van der Laan
- Katerina Kagialari
- Robert K. Wagner
- Job N. Doornberg
- Rosanne C. Schoonbeek
- Charlotte MHHT Bootsma-Robroeks
- M. Aalderink
- R. Van den Berg
- M.T.P. Besouw
- A.V. Biere
- FAJA Bodewes
- A.L. Boerboom
- M.A.J. Borgdorff
- MH de Borst
- M. Bouhuys
- B.R. Brandsema
- G.H. Bultema
- M.J. Crop
- H.P.J. Van der Doef
- J.W.J. Donkers
- J.M. Douwes
- R.A. Feijen
- F. Fontanella
- B. Foreman
- V. Gracchi
- I. De Groot
- G.B. Halmos
- A.A. Van Heerwaarde
- F. Van den Heuvel
- C. Holzhauer
- FFA IJpma
- E. Kersten
- R.J.H. Knoef
- M.C.A. Kramer
- S. Krishnapillai
- M. Labberté
- J.M. Lammers
- L.B. De Langen
- E. Lensen
- WS Lexmond
- E.T. Liem
- E. Loeffen
- J. Lorius
- C. Lubout
- J. Ludwig-Roukema
- S. Luiten
- D. Meijering
- C. Out
- S. Palthe
- M.T.R. Roofthooft
- R. Scheenstra
- R.S.B.H. Schreuder
- M.L. Schrijvers
- P.F. Sinnige
- W.J. Van Veen
- C.A. Te Velde-Keyzer
- KT Verbruggen
- M. Verheijen
- F.P.J. Vernimmen
- J. A. de Vries
- W. De Weerd
- C.L. Welsink
- J.E.J. Woolderink
- A.T. Zwart