Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
Artificial Intelligence in Cancer Research: Modality Dependence and Limited Visual–Spatial Integration in Multimodal Large Language Models for Breast Cancer Histopathology
0
Zitationen
5
Autoren
2026
Jahr
Abstract
Multimodal large language models (MLLMs) are increasingly considered for cancer diagnostic support, yet their suitability for histopathological image interpretation remains inadequately characterized. We evaluated six contemporary general-purpose MLLMs (Claude Opus 4.6, Claude Sonnet 4.6, Claude Haiku 4.5, ChatGPT 5.3, Grok 4.2, Gemini 3.1 Pro) on 58 paired hematoxylin and eosin (H&E)-stained breast cancer histopathology images (26 malignant, 32 benign) and corresponding nuclei segmentation masks. Each case was classified five times per model under three conditions, image only (IMAGE), mask only (MASK), and both combined (BOTH), yielding 5220 observations. Mean accuracy dropped from 69.4% (IMAGE) to 49.6% (MASK), below the majority-class baseline of 55.2%. Providing the mask together with the image did not improve classification (68.0%), and for ChatGPT 5.3 produced a net loss of 31 correct predictions. Models maintained elevated mean confidence (67.6) under MASK despite near-random accuracy, and reasoning categories shifted in 67.5% of matched case–run pairs between modalities. Under the conditions tested, current general-purpose MLLMs exhibit strong dependence on visual surface features, fail to effectively integrate spatial structural information, and maintain confidence independent of accuracy. These behavioral limitations are directly relevant to the safe deployment of MLLMs in cancer diagnostic workflows.
Ähnliche Arbeiten
A survey on deep learning in medical image analysis
2017 · 14.008 Zit.
pROC: an open-source package for R and S+ to analyze and compare ROC curves
2011 · 13.802 Zit.
Dermatologist-level classification of skin cancer with deep neural networks
2017 · 13.525 Zit.
A survey on Image Data Augmentation for Deep Learning
2019 · 12.145 Zit.
QuPath: Open source software for digital pathology image analysis
2017 · 8.432 Zit.
Autoren
Institutionen
- Friedrich-Alexander-Universität Erlangen-Nürnberg(DE)
- Otto-von-Guericke-Universität Magdeburg(DE)
- Gemeinschaftskrankenhaus Havelhöhe(DE)
- RWTH Aachen University(DE)
- Heidelberg University(DE)
- University Hospital Heidelberg(DE)
- Diabetesinstitut Heidelberg(DE)
- Berlin Center for Epidemiology and Health Research(DE)