Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
AI-Generated Clinical Summaries: Errors and Susceptibility to Speech and Speaker Variability
0
Zitationen
9
Autoren
2025
Jahr
Abstract
OBJECTIVES: To evaluate whether variability in patients' communication style (personality), international English-accents (human and synthetic) and speech impairments affects the accuracy of a Clinical AI Scribe (CAIS) and identify where performance degrades to inform pre-deployment validation and monitoring. METHODS: We conducted simulated primary-care consultations using trained actors. For personality types, four scenarios were enacted, each with five patient-personality types. For accents, transcripts of consultations were used to generate combinations of seven accents across five scenarios. The CAIS produced summaries that were compared with transcripts, and errors classified as omissions, factual inaccuracies or hallucinations. For speech impairments, public recordings representing five profiles were transcribed and word-recognition accuracy calculated. RESULTS: Personality types showed no statistically significant differences in errors (all p>0.05). Extraversion had the highest total errors (median 3.5). Across accents, comparisons were non-significant for both patient and doctor voices (patients: p=0.851; doctors: p=0.980). Omissions predominated, with low rates of hallucinations and factual inaccuracies. Omissions were slightly higher for Chinese-accented and Indian-accented doctors (both medians 3.0). Conversely, speech impairments differed: cleft palate and vowel disorders were near-perfect, whereas phonological impairment markedly reduced recognition (p<0.001). DISCUSSION: Operationally, CAIS deployment should include clinician-in-the-loop verification, subgroup performance monitoring (accents, impairments) and predefined 'switch-off' criteria for severe phonological patterns. High-quality synthetic voices are a pragmatic proxy for accent testing when balanced corpora are unavailable. CONCLUSIONS: Under controlled conditions CAIS performance was broadly stable across communication styles and most accents, but vulnerable to specific speech characteristics, particularly phonological impairment, in this single-system simulation study.
Ähnliche Arbeiten
Concurrent Chemotherapy and Radiotherapy for Organ Preservation in Advanced Laryngeal Cancer
2003 · 3.165 Zit.
Induction Chemotherapy plus Radiation Compared with Surgery plus Radiation in Patients with Advanced Laryngeal Cancer
1991 · 2.736 Zit.
Global, regional, and national burden of Parkinson's disease, 1990–2016: a systematic analysis for the Global Burden of Disease Study 2016
2018 · 2.732 Zit.
The Voice Handicap Index (VHI)
1997 · 2.510 Zit.
Reliability and Factor Analysis of the Epworth Sleepiness Scale
1992 · 2.131 Zit.