Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
The potential of Generative Pre-trained Transformer 4 (GPT-4) to analyse medical notes in three different languages: a retrospective model-evaluation study
34
Zitationen
39
Autoren
2024
Jahr
Abstract
BACKGROUND: Patient notes contain substantial information but are difficult for computers to analyse due to their unstructured format. Large-language models (LLMs), such as Generative Pre-trained Transformer 4 (GPT-4), have changed our ability to process text, but we do not know how effectively they handle medical notes. We aimed to assess the ability of GPT-4 to answer predefined questions after reading medical notes in three different languages. METHODS: For this retrospective model-evaluation study, we included eight university hospitals from four countries (ie, the USA, Colombia, Singapore, and Italy). Each site submitted seven de-identified medical notes related to seven separate patients to the coordinating centre between June 1, 2023, and Feb 28, 2024. Medical notes were written between Feb 1, 2020, and June 1, 2023. One site provided medical notes in Spanish, one site provided notes in Italian, and the remaining six sites provided notes in English. We included admission notes, progress notes, and consultation notes. No discharge summaries were included in this study. We advised participating sites to choose medical notes that, at time of hospital admission, were for patients who were male or female, aged 18-65 years, had a diagnosis of obesity, had a diagnosis of COVID-19, and had submitted an admission note. Adherence to these criteria was optional and participating sites randomly chose which medical notes to submit. When entering information into GPT-4, we prepended each medical note with an instruction prompt and a list of 14 questions that had been chosen a priori. Each medical note was individually given to GPT-4 in its original language and in separate sessions; the questions were always given in English. At each site, two physicians independently validated responses by GPT-4 and responded to all 14 questions. Each pair of physicians evaluated responses from GPT-4 to the seven medical notes from their own site only. Physicians were not masked to responses from GPT-4 before providing their own answers, but were masked to responses from the other physician. FINDINGS: We collected 56 medical notes, of which 42 (75%) were in English, seven (13%) were in Italian, and seven (13%) were in Spanish. For each medical note, GPT-4 responded to 14 questions, resulting in 784 responses. In 622 (79%, 95% CI 76-82) of 784 responses, both physicians agreed with GPT-4. In 82 (11%, 8-13) responses, only one physician agreed with GPT-4. In the remaining 80 (10%, 8-13) responses, neither physician agreed with GPT-4. Both physicians agreed with GPT-4 more often for medical notes written in Spanish (86 [88%, 95% CI 79-93] of 98 responses) and Italian (82 [84%, 75-90] of 98 responses) than in English (454 [77%, 74-80] of 588 responses). INTERPRETATION: The results of our model-evaluation study suggest that GPT-4 is accurate when analysing medical notes in three different languages. In the future, research should explore how LLMs can be integrated into clinical workflows to maximise their use in health care. FUNDING: None.
Ähnliche Arbeiten
Machine Learning in Medicine
2019 · 3.818 Zit.
Systematic Review: Impact of Health Information Technology on Quality, Efficiency, and Costs of Medical Care
2006 · 3.176 Zit.
Effects of Computerized Clinical Decision Support Systems on Practitioner Performance and Patient Outcomes
2005 · 2.972 Zit.
Studies in health technology and informatics
2008 · 2.903 Zit.
An overview of clinical decision support systems: benefits, risks, and strategies for success
2020 · 2.743 Zit.
Autoren
- Maria Clara Saad Menezes
- Alexander Hoffmann
- Amelia L.M. Tan
- Mariné Nalbandyan
- Gilbert S. Omenn
- Diego R. Mazzotti
- Alejandro Hernández-Arango
- Shyam Visweswaran
- Shruthi Venkatesh
- Kenneth D. Mandl
- Florence T. Bourgeois
- James W K Lee
- Andrew Makmur
- David A. Hanauer
- Michael G. Semanik
- Lauren T. Kerivan
- Terra Hill
- Julian Forero
- Carlos Alberto Velásquez Restrepo
- Matteo Vigna
- Piero Ceriana
- Noor Abu-el-rub
- Paul Avillach
- Riccardo Bellazzi
- Thomas Callaci
- Alba Gutiérrez‐Sacristán
- Alberto Malovini
- Jomol Mathew
- Michele Morris
- Venkatesh L. Murthy
- Tommaso Mario Buonocore
- Enea Parimbelli
- Lav P. Patel
- Carlos Sáez
- Malarkodi Jebathilagam Samayamuthu
- Jeffrey Thompson
- Valentina Tibollo
- Zongqi Xia
- Isaac S. Kohane
Institutionen
- The University of Texas Southwestern Medical Center(US)
- Harvard University(US)
- University of Wisconsin–Madison(US)
- Wisconsin Division of Public Health(US)
- University of Michigan(US)
- University of Kansas Medical Center(US)
- Universidad de Antioquia(CO)
- University of Pittsburgh(US)
- Boston Children's Hospital(US)
- National University Health System(SG)
- Istituti Clinici Scientifici Maugeri(IT)
- Istituti di Ricovero e Cura a Carattere Scientifico(IT)
- University of Pavia(IT)
- Michigan Medicine(US)
- Universitat Politècnica de València(ES)