Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
The Capabilities of Large Language Models in Extracting Unstructured Data From Histopathology Reports
1
Zitationen
6
Autoren
2025
Jahr
Abstract
The gold standard for medical data extraction has traditionally been manual; however, this approach is very time consuming, labour intensive, expensive and prone to error. Many approaches to automated data extraction have been explored over the years; however, they have required significant technical knowledge and have not been reliably accurate. Large Language Models (LLMs) have been developed at an astronomical pace and have demonstrated incredible accuracy, and they are continuing to evolve and improve. The significant time and cost savings achieved with LLMs will allow for more efficient research and real-time monitoring of patient outcomes. This review explores how LLMs can be used in medical data collection, including the types of data collected and output given, types of LLMs used, amount of training required, the accuracy, speed, and cost of data extraction, types of errors commonly made, and any security concerns. There are still many challenges to overcome, particularly with reducing hallucinations/fictitious content and other common errors, ensuring patient privacy, handling complex tasks and producing output in clean and usable formats. Health professionals and researchers in the field of dermatology must be well trained and upskilled in the use of these new technologies and should continue to explore and build on what has already been achieved to optimise the use of LLMs in the automated data extraction process.
Ähnliche Arbeiten
A survey on deep learning in medical image analysis
2017 · 13.877 Zit.
pROC: an open-source package for R and S+ to analyze and compare ROC curves
2011 · 13.749 Zit.
Dermatologist-level classification of skin cancer with deep neural networks
2017 · 13.437 Zit.
A survey on Image Data Augmentation for Deep Learning
2019 · 12.029 Zit.
QuPath: Open source software for digital pathology image analysis
2017 · 8.377 Zit.