Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
A framework for evaluating clinical artificial intelligence systems without ground-truth annotations
17
Zitationen
4
Autoren
2024
Jahr
Abstract
Abstract A clinical artificial intelligence (AI) system is often validated on data withheld during its development. This provides an estimate of its performance upon future deployment on data in the wild; those currently unseen but are expected to be encountered in a clinical setting. However, estimating performance on data in the wild is complicated by distribution shift between data in the wild and withheld data and the absence of ground-truth annotations. Here, we introduce SUDO, a framework for evaluating AI systems on data in the wild. Through experiments on AI systems developed for dermatology images, histopathology patches, and clinical notes, we show that SUDO can identify unreliable predictions, inform the selection of models, and allow for the previously out-of-reach assessment of algorithmic bias for data in the wild without ground-truth annotations. These capabilities can contribute to the deployment of trustworthy and ethical AI systems in medicine.
Ähnliche Arbeiten
A survey on deep learning in medical image analysis
2017 · 14.019 Zit.
pROC: an open-source package for R and S+ to analyze and compare ROC curves
2011 · 13.808 Zit.
Dermatologist-level classification of skin cancer with deep neural networks
2017 · 13.528 Zit.
A survey on Image Data Augmentation for Deep Learning
2019 · 12.149 Zit.
QuPath: Open source software for digital pathology image analysis
2017 · 8.437 Zit.