Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
A Standardized Validation Framework for Clinically Actionable Healthcare Machine Learning with Knee Osteoarthritis Grading as a Case Study
6
Zitationen
4
Autoren
2025
Jahr
Abstract
Background: High in-domain accuracy in healthcare machine learning (ML) models does not guarantee reliable clinical performance, especially when training and validation protocols are insufficiently robust. This paper presents a standardized framework for training and validating ML models intended for classifying medical conditions, emphasizing the need for clinically relevant evaluation metrics and external validation. Methods: We apply this framework to a case study in knee osteoarthritis grading, demonstrating how overfitting, data leakage, and inadequate validation can lead to deceptively high accuracy that fails to translate into clinical reliability. In addition to conventional metrics, we introduce composite clinical measures that better capture real-world utility. Results: Our findings show that models with strong in-domain performance may underperform on external datasets, and that composite metrics provide a more nuanced assessment of clinical applicability. Conclusions: Standardized training and validation protocols, together with clinically oriented evaluation, are essential for developing ML models that are both statistically robust and clinically reliable across a range of medical classification tasks.
Ähnliche Arbeiten
Explainable Artificial Intelligence (XAI): Concepts, taxonomies, opportunities and challenges toward responsible AI
2019 · 8.316 Zit.
Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead
2019 · 8.177 Zit.
High-performance medicine: the convergence of human and artificial intelligence
2018 · 7.575 Zit.
Proceedings of the 19th International Joint Conference on Artificial Intelligence
2005 · 5.776 Zit.
Peeking Inside the Black-Box: A Survey on Explainable Artificial Intelligence (XAI)
2018 · 5.468 Zit.