Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
Large language models versus classical machine learning performance in COVID-19 mortality prediction using high-dimensional tabular data
1
Zitationen
42
Autoren
2025
Jahr
Abstract
This study compared the performance of classical feature-based machine learning models (CMLs) and large language models (LLMs) in predicting COVID-19 mortality using high-dimensional tabular data from 9,134 patients across four hospitals. Seven CML models, including XGBoost and random forest (RF), were evaluated alongside eight LLMs, such as GPT-4 and Mistral-7b, which performed zero-shot classification on text-converted structured data. Additionally, Mistral-7b was fine-tuned using the QLoRA approach. XGBoost and RF demonstrated superior performance among CMLs, achieving F1 scores of 0.87 and 0.83 for internal and external validation, respectively. GPT-4 led the LLM category with an F1 score of 0.43, while fine-tuning Mistral-7b significantly improved its recall from 1% to 79%, yielding a stable F1 score of 0.74 during external validation. Although LLMs showed moderate performance in zero-shot classification, fine-tuning substantially enhanced their effectiveness, potentially bridging the gap with CML models. However, CMLs still outperformed LLMs in handling high-dimensional tabular data tasks. This study highlights the potential of both CMLs and fine-tuned LLMs in medical predictive modeling, while emphasizing the current superiority of CMLs for structured data analysis.
Ähnliche Arbeiten
La certeza de lo impredecible: Cultura Educación y Sociedad en tiempos de COVID19
2020 · 19.307 Zit.
A Multi-Modal Distributed Real-Time IoT System for Urban Traffic Control (Invited Paper)
2024 · 14.302 Zit.
UNet++: A Nested U-Net Architecture for Medical Image Segmentation
2018 · 8.797 Zit.
Review of deep learning: concepts, CNN architectures, challenges, applications, future directions
2021 · 7.407 Zit.
scikit-image: image processing in Python
2014 · 6.841 Zit.
Autoren
- Mohammadreza Ghaffarzadeh-Esfahani
- Mahdi Ghaffarzadeh-Esfahani
- Aryan Salahi‐Niri
- Hossein Toreyhi
- Zahra Atf
- Amirali Mohsenzadeh-Kermani
- Mahshad Sarikhani
- Zohreh Tajabadi
- Fatemeh Shojaeian
- Mohammad Hassan Bagheri
- Aydin Feyzi
- Mohamadamin Tarighat-Payma
- Narges Gazmeh
- Fatemeh Heydari
- Hossein Afshar
- Amirreza Allahgholipour
- Farid Alimardani
- Ameneh Salehi
- Naghmeh Asadimanesh
- Mohammad Khalafi
- Hadis Shabanipour
- Ali Moradi
- Sajjad Hossein Zadeh
- Omid Yazdani
- Romina Esbati
- Moozhan Maleki
- Danial Samiei Nasr
- Amirali Soheili
- Hossein Majlesi
- Saba Shahsavan
- Alireza Soheilipour
- Nooshin Goudarzi
- Erfan Taherifard
- Hamidreza Hatamabadi
- Jamil S. Samaan
- Thomas Savage
- Ankit Sakhuja
- Ali Soroush
- Girish N. Nadkarni
- Ilad Alavi Darazam
- Mohamad Amin Pourhoseingholi
- Seyed Amir Ahmad Safavi‐Naini
Institutionen
- Isfahan University of Medical Sciences(IR)
- Shahid Beheshti University of Medical Sciences(IR)
- Ontario Tech University(CA)
- Tehran University of Medical Sciences(IR)
- Johns Hopkins University(US)
- Iranshahr University(IR)
- Shiraz University of Medical Sciences(IR)
- Imam Hossein Hospital(IR)
- Cedars-Sinai Medical Center(US)
- Stanford University(US)
- Icahn School of Medicine at Mount Sinai(US)
- Research Institute for Endocrine Sciences(IR)
- NIHR Nottingham Hearing Biomedical Research Unit(GB)
- Nottingham Biomedical Research Centre(GB)
- Child Health and Development Institute(US)