Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
A whole-slide foundation model for digital pathology from real-world data
519
Zitationen
28
Autoren
2024
Jahr
Abstract
Digital pathology poses unique computational challenges, as a standard gigapixel slide may comprise tens of thousands of image tiles<sup>1-3</sup>. Prior models have often resorted to subsampling a small portion of tiles for each slide, thus missing the important slide-level context<sup>4</sup>. Here we present Prov-GigaPath, a whole-slide pathology foundation model pretrained on 1.3 billion 256 × 256 pathology image tiles in 171,189 whole slides from Providence, a large US health network comprising 28 cancer centres. The slides originated from more than 30,000 patients covering 31 major tissue types. To pretrain Prov-GigaPath, we propose GigaPath, a novel vision transformer architecture for pretraining gigapixel pathology slides. To scale GigaPath for slide-level learning with tens of thousands of image tiles, GigaPath adapts the newly developed LongNet<sup>5</sup> method to digital pathology. To evaluate Prov-GigaPath, we construct a digital pathology benchmark comprising 9 cancer subtyping tasks and 17 pathomics tasks, using both Providence and TCGA data<sup>6</sup>. With large-scale pretraining and ultra-large-context modelling, Prov-GigaPath attains state-of-the-art performance on 25 out of 26 tasks, with significant improvement over the second-best method on 18 tasks. We further demonstrate the potential of Prov-GigaPath on vision-language pretraining for pathology<sup>7,8</sup> by incorporating the pathology reports. In sum, Prov-GigaPath is an open-weight foundation model that achieves state-of-the-art performance on various digital pathology tasks, demonstrating the importance of real-world data and whole-slide modelling.
Ähnliche Arbeiten
A survey on deep learning in medical image analysis
2017 · 13.595 Zit.
Dermatologist-level classification of skin cancer with deep neural networks
2017 · 13.204 Zit.
A survey on Image Data Augmentation for Deep Learning
2019 · 11.823 Zit.
QuPath: Open source software for digital pathology image analysis
2017 · 8.202 Zit.
Radiomics: Images Are More than Pictures, They Are Data
2015 · 8.026 Zit.
Autoren
- Hanwen Xu
- Naoto Usuyama
- Jaspreet Bagga
- Sheng Zhang
- Rajesh C. Rao
- Tristan Naumann
- Cliff Wong
- Zelalem Gero
- Javier González
- 裕二 池谷
- Yanbo Xu
- Mu Wei
- Wenhui Wang
- Shuming Ma
- Furu Wei
- Jianwei Yang
- Chunyuan Li
- Jianfeng Gao
- Jaylen Rosemon
- Tucker C. Bower
- Soohee Lee
- Roshanthi Weerasinghe
- Bill Wright
- Ari Robicsek
- Brian Piening
- Carlo Bifulco
- Sheng Wang
- Hoifung Poon