Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
Surgeon, Trainee, or <scp>GPT</scp> ? A Blinded Multicentric Study of <scp>AI</scp> ‐Augmented Operative Notes
5
Zitationen
15
Autoren
2025
Jahr
Abstract
ABSTRACT Objectives Clear, complete operative documentation is essential for surgical safety, continuity of care, and medico‐legal standards. Large language models such as ChatGPT offer promise for automating clinical documentation; however, their performance in operative note generation, particularly in surgical subspecialties, remains underexplored. This study aimed to compare the quality, accuracy, and efficiency of operative notes authored by a surgical resident, attending surgeon, GPT alone, and an attending surgeon using GPT as a writing aid. Methods Five publicly available otolaryngologic procedures were selected. For each procedure, four operative notes were generated, one by a resident, one by an attending, one by GPT alone, and one by a hybrid of attending plus GPT. Ten blinded otolaryngologists (five residents, five attendings) independently reviewed all 20 notes. Reviewers scored each note across eight domains using a five‐point scale, assigned a final approval rating, and provided qualitative feedback. Writing time was recorded to assess documentation efficiency. Results Hybrid notes written by an attending surgeon with GPT assistance received the highest average domain scores and the highest “as is” approval rate (79%), outperforming all other groups. GPT‐only notes were the fastest to generate but had the lowest approval rate (23%) and the highest incidence of both omissions and overdocumentation. Writing time was significantly reduced in both AI‐assisted groups compared to human‐only authorship. Inter‐rater reliability among reviewers was moderate to high across most domains. Conclusion In this limited dataset, hybrid human–AI collaboration outperformed both human‐only and AI‐only authorship in operative documentation. These findings support GPT‐assisted documentation to improve operative note efficiency and consistency. Level of Evidence N/A.
Ähnliche Arbeiten
The SCARE 2018 statement: Updating consensus Surgical CAse REport (SCARE) guidelines
2018 · 2.335 Zit.
The Levels of Evidence and Their Role in Evidence-Based Medicine
2011 · 2.121 Zit.
CARE guidelines for case reports: explanation and elaboration document
2017 · 2.035 Zit.
The SCARE Statement: Consensus-based surgical case report guidelines
2016 · 1.681 Zit.
Dermoscopy of pigmented skin lesions: Results of a consensus meeting via the Internet
2003 · 1.201 Zit.
Autoren
Institutionen
- Sheba Medical Center(IL)
- AOL (United States)(US)
- Ospedale San Paolo(IT)
- Royal Hallamshire Hospital(GB)
- Sheffield Teaching Hospitals NHS Foundation Trust(GB)
- Università degli Studi di Enna Kore(IT)
- University of Catania(IT)
- University of Antwerp(BE)
- Antwerp University Hospital(BE)
- Hospital General Universitario De Valencia(ES)
- Biogipuzkoa Health Research Institute(ES)
- Osakidetza(ES)
- Children's National(US)