Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
Large language models versus classical machine learning performance in COVID-19 mortality prediction using high-dimensional tabular data
1
Zitationen
42
Autoren
2025
Jahr
Abstract
This study compared the performance of classical feature-based machine learning models (CMLs) and large language models (LLMs) in predicting COVID-19 mortality using high-dimensional tabular data from 9,134 patients across four hospitals. Seven CML models, including XGBoost and random forest (RF), were evaluated alongside eight LLMs, such as GPT-4 and Mistral-7b, which performed zero-shot classification on text-converted structured data. Additionally, Mistral-7b was fine-tuned using the QLoRA approach. XGBoost and RF demonstrated superior performance among CMLs, achieving F1 scores of 0.87 and 0.83 for internal and external validation, respectively. GPT-4 led the LLM category with an F1 score of 0.43, while fine-tuning Mistral-7b significantly improved its recall from 1% to 79%, yielding a stable F1 score of 0.74 during external validation. Although LLMs showed moderate performance in zero-shot classification, fine-tuning substantially enhanced their effectiveness, potentially bridging the gap with CML models. However, CMLs still outperformed LLMs in handling high-dimensional tabular data tasks. This study highlights the potential of both CMLs and fine-tuned LLMs in medical predictive modeling, while emphasizing the current superiority of CMLs for structured data analysis.
Ähnliche Arbeiten
Epidemiological and clinical characteristics of 99 cases of 2019 novel coronavirus pneumonia in Wuhan, China: a descriptive study
2020 · 22.617 Zit.
La certeza de lo impredecible: Cultura Educación y Sociedad en tiempos de COVID19
2020 · 19.271 Zit.
A Multi-Modal Distributed Real-Time IoT System for Urban Traffic Control (Invited Paper)
2024 · 14.265 Zit.
UNet++: A Nested U-Net Architecture for Medical Image Segmentation
2018 · 8.562 Zit.
Review of deep learning: concepts, CNN architectures, challenges, applications, future directions
2021 · 7.176 Zit.
Autoren
- Mohammadreza Ghaffarzadeh-Esfahani
- Mahdi Ghaffarzadeh-Esfahani
- Aryan Salahi‐Niri
- Hossein Toreyhi
- Zahra Atf
- Amirali Mohsenzadeh-Kermani
- Mahshad Sarikhani
- Zohreh Tajabadi
- Fatemeh Shojaeian
- Mohammad Hassan Bagheri
- Aydin Feyzi
- Mohamadamin Tarighat-Payma
- Narges Gazmeh
- Fatemeh Heydari
- Hossein Afshar
- Amirreza Allahgholipour
- Farid Alimardani
- Ameneh Salehi
- Naghmeh Asadimanesh
- Mohammad Khalafi
- Hadis Shabanipour
- Ali Moradi
- Sajjad Hossein Zadeh
- Omid Yazdani
- Romina Esbati
- Moozhan Maleki
- Danial Samiei Nasr
- Amirali Soheili
- Hossein Majlesi
- Saba Shahsavan
- Alireza Soheilipour
- Nooshin Goudarzi
- Erfan Taherifard
- Hamidreza Hatamabadi
- Jamil S. Samaan
- Thomas Savage
- Ankit Sakhuja
- Ali Soroush
- Girish N. Nadkarni
- Ilad Alavi Darazam
- Mohamad Amin Pourhoseingholi
- Seyed Amir Ahmad Safavi‐Naini
Institutionen
- Isfahan University of Medical Sciences(IR)
- Shahid Beheshti University of Medical Sciences(IR)
- Ontario Tech University(CA)
- Tehran University of Medical Sciences(IR)
- Johns Hopkins University(US)
- Iranshahr University(IR)
- Shiraz University of Medical Sciences(IR)
- Imam Hossein Hospital(IR)
- Cedars-Sinai Medical Center(US)
- Stanford University(US)
- Icahn School of Medicine at Mount Sinai(US)
- Research Institute for Endocrine Sciences(IR)
- NIHR Nottingham Hearing Biomedical Research Unit(GB)
- Nottingham Biomedical Research Centre(GB)
- Child Health and Development Institute(US)