OpenAlex · Aktualisierung stündlich · Letzte Aktualisierung: 09.05.2026, 03:55

Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.

Association Between Voluntary Engagement with LLM-Powered Virtual Standardized Patients and Medical Students’ Clinical Interview Performance: Retrospective Observational Study (Preprint)

2026·0 Zitationen
Volltext beim Verlag öffnen

0

Zitationen

7

Autoren

2026

Jahr

Abstract

<sec> <title>BACKGROUND</title> Virtual Standardized Patients (VSPs) powered by large language models (LLMs) are emerging as scalable tools to supplement human Standardized Patient (SP) training in medical education. However, their added value in settings with established SP programs remains underexplored. </sec> <sec> <title>OBJECTIVE</title> To assess whether voluntary engagement with an LLM-powered VSP is associated with medical students’ SP interview scores within established in-person SP programs. </sec> <sec> <title>METHODS</title> We analyzed VSP usage logs and SP assessment data from fourth-year undergraduate medical students enrolled in a 7-week diagnostics course with weekly in-person SP training sessions. Students had voluntary access to a LLM-powered VSP system. VSP usage was quantified by the number of valid sessions and the total number of question–answer (QA) pairs. The primary outcome was the composite SP interview score (0–100). Spearman and Pearson correlations, linear regression, and independent t tests were conducted. </sec> <sec> <title>RESULTS</title> A total of 92 students generated 359 valid VSP dialogues comprising 19,380 QA pairs; the overall hallucination rate was 0.34%. Median VSP usage was 2 sessions (IQR 1-7, range 0-18). Students produced a median of 132 QA pairs (IQR 38-323; range 0-916). The mean SP interview score was 92.8 ± 2.3. VSP session frequency showed a weak but statistically significant positive correlation with SP scores (Spearman ρ = 0.25, P = .016). Linear regression indicated that each additional session was associated with a 0.15 point increase in SP score (β = 0.15; 95% CI: 0.040 - 0.26; P = .011; R² = 0.070). A threshold of 7 sessions distinguished a high-use group (≥ 7 sessions, n = 22) that achieved higher scores than the low-use group (&lt; 7 sessions, n = 70) (94.1 ± 2.0 vs. 92.5 ± 2.4; P = .013). Total QA pairs showed a moderate positive correlation with SP scores (Spearman ρ = 0.28, P = .006). Regression revealed that each additional QA pair was associated with a 0.0033 point increase in SP score (β = 0.0033; 95% CI: 0.0011 - 0.0055; P = .002; R² = 0.106). Students in the high-QA group (≥ 132 pairs, n = 46) scored higher than those in the low-QA group (&lt; 132 pairs, n = 46) (93.3 ± 2.3 vs. 92.3 ± 2.3; P = .030). </sec> <sec> <title>CONCLUSIONS</title> Within a human SP curriculum, autonomous engagement with an LLM-powered VSP was associated with small but meaningful improvements in clinical interview performance. VSPs appear to function as scalable “cognitive replay” environments that provide low-stakes, feedback-rich practice. Educational strategies should move beyond counting VSP exposures toward fostering deeper, reflective engagement with AI-based simulations. </sec>

Ähnliche Arbeiten

Autoren

Themen

Artificial Intelligence in Healthcare and EducationSimulation-Based Education in HealthcareDigital Mental Health Interventions
Volltext beim Verlag öffnen