Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
Testing and evaluation of generative large language models in electronic health record applications: a systematic review
3
Zitationen
14
Autoren
2025
Jahr
Abstract
Our findings highlight the need to evaluate generative LLMs on EHR data across a broader range of clinical specialties and tasks, as well as the urgent need for standardized, scalable, and clinically meaningful evaluation frameworks.
Ähnliche Arbeiten
"Why Should I Trust You?"
2016 · 14.281 Zit.
A Comprehensive Survey on Graph Neural Networks
2020 · 8.646 Zit.
Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead
2019 · 8.169 Zit.
High-performance medicine: the convergence of human and artificial intelligence
2018 · 7.564 Zit.
Artificial intelligence in healthcare: past, present and future
2017 · 4.399 Zit.