Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
A multimodal generative AI copilot for human pathology
307
Zitationen
20
Autoren
2024
Jahr
Abstract
Computational pathology<sup>1,2</sup> has witnessed considerable progress in the development of both task-specific predictive models and task-agnostic self-supervised vision encoders<sup>3,4</sup>. However, despite the explosive growth of generative artificial intelligence (AI), there have been few studies on building general-purpose multimodal AI assistants and copilots<sup>5</sup> tailored to pathology. Here we present PathChat, a vision-language generalist AI assistant for human pathology. We built PathChat by adapting a foundational vision encoder for pathology, combining it with a pretrained large language model and fine-tuning the whole system on over 456,000 diverse visual-language instructions consisting of 999,202 question and answer turns. We compare PathChat with several multimodal vision-language AI assistants and GPT-4V, which powers the commercially available multimodal general-purpose AI assistant ChatGPT-4 (ref. <sup>6</sup>). PathChat achieved state-of-the-art performance on multiple-choice diagnostic questions from cases with diverse tissue origins and disease models. Furthermore, using open-ended questions and human expert evaluation, we found that overall PathChat produced more accurate and pathologist-preferable responses to diverse queries related to pathology. As an interactive vision-language AI copilot that can flexibly handle both visual and natural language inputs, PathChat may potentially find impactful applications in pathology education, research and human-in-the-loop clinical decision-making.
Ähnliche Arbeiten
A survey on deep learning in medical image analysis
2017 · 13.616 Zit.
Dermatologist-level classification of skin cancer with deep neural networks
2017 · 13.220 Zit.
A survey on Image Data Augmentation for Deep Learning
2019 · 11.837 Zit.
QuPath: Open source software for digital pathology image analysis
2017 · 8.207 Zit.
Radiomics: Images Are More than Pictures, They Are Data
2015 · 8.030 Zit.
Autoren
Institutionen
- Brigham and Women's Hospital(US)
- Massachusetts Institute of Technology(US)
- Broad Institute(US)
- Massachusetts General Hospital(US)
- Harvard University(US)
- The Ohio State University Wexner Medical Center(US)
- Pusan National University(KR)
- Mayo Clinic in Arizona(US)
- Harvard–MIT Division of Health Sciences and Technology(US)