Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
Medical multimodal large language models: A survey
0
Zitationen
5
Autoren
2026
Jahr
Abstract
• • Provide a detailed analysis of the tasks, architecture, optimization methods, and prompt enhancement techniques of medical MLLMs. • • Offer a comprehensive introduction to the training methods and stages of medical MLLMs, categorizing datasets based on different training phases. • • Discuss two advanced training strategies for activating the reasoning ability of medical MLLMs: supervised reasoning fine-tuning and deep reinforcement learning. • • Summarize the key challenges currently faced by medical MLLMs and propose potential solutions. In recent years, multimodal large language models (MLLMs) have gradually given rise to medical multimodal large language models (Medical MLLMs) through the integration of multimodal data such as clinical reports, medical images, physiological signals, and doctor–patient conversations. This progress has substantially improved the efficiency and quality of clinical question answering. Given the rapid development of this field and its broad clinical potential, this survey presents a systematic review of the core tasks, fundamental principles, methodological innovations, and future research directions of Medical MLLMs. Specifically, this survey first outlines the core medical tasks addressed by Medical MLLMs. It then dissects the three key modules of Medical MLLMs and, through a fine-grained vector-level analysis, explains how input feature vectors are processed and propagated across these modules. Subsequently, existing medical datasets are systematically categorized according to different stages of model training to help researchers identify relevant resources more efficiently. Furthermore, this survey elaborates on the training and evaluation methods of Medical MLLMs and discusses advanced strategies for activating their reasoning capabilities. Finally, it summarizes their practical applications in medical scenarios and highlights the critical challenges and future directions. This survey aims to provide a comprehensive reference for researchers in the field of Medical MLLMs and to promote the adaptation of Medical MLLMs to increasingly complex and diverse medical tasks.
Ähnliche Arbeiten
"Why Should I Trust You?"
2016 · 14.794 Zit.
Coding Algorithms for Defining Comorbidities in ICD-9-CM and ICD-10 Administrative Data
2005 · 10.558 Zit.
A Comprehensive Survey on Graph Neural Networks
2020 · 8.990 Zit.
Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead
2019 · 8.602 Zit.
High-performance medicine: the convergence of human and artificial intelligence
2018 · 8.127 Zit.