Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
Are Generative Pretrained Transformer 4 Responses to Developmental Dysplasia of the Hip Clinical Scenarios Universal? An International Review
7
Zitationen
20
Autoren
2024
Jahr
Abstract
OBJECTIVE: There is increasing interest in applying artificial intelligence chatbots like generative pretrained transformer 4 (GPT-4) in the medical field. This study aimed to explore the universality of GPT-4 responses to simulated clinical scenarios of developmental dysplasia of the hip (DDH) across diverse global settings. METHODS: Seventeen international experts with more than 15 years of experience in pediatric orthopaedics were selected for the evaluation panel. Eight simulated DDH clinical scenarios were created, covering 4 key areas: (1) initial evaluation and diagnosis, (2) initial examination and treatment, (3) nursing care and follow-up, and (4) prognosis and rehabilitation planning. Each scenario was completed independently in a new GPT-4 session. Interrater reliability was assessed using Fleiss kappa, and the quality, relevance, and applicability of GPT-4 responses were analyzed using median scores and interquartile ranges. Following scoring, experts met in ZOOM sessions to generate Regional Consensus Assessment Scores, which were intended to represent a consistent regional assessment of the use of the GPT-4 in pediatric orthopaedic care. RESULTS: GPT-4's responses to the 8 clinical DDH scenarios received performance scores ranging from 44.3% to 98.9% of the 88-point maximum. The Fleiss kappa statistic of 0.113 ( P = 0.001) indicated low agreement among experts in their ratings. When assessing the responses' quality, relevance, and applicability, the median scores were 3, with interquartile ranges of 3 to 4, 3 to 4, and 2 to 3, respectively. Significant differences were noted in the prognosis and rehabilitation domain scores ( P < 0.05 for all). Regional consensus scores were 75 for Africa, 74 for Asia, 73 for India, 80 for Europe, and 65 for North America, with the Kruskal-Wallis test highlighting significant disparities between these regions ( P = 0.034). CONCLUSIONS: This study demonstrates the promise of GPT-4 in pediatric orthopaedic care, particularly in supporting preliminary DDH assessments and guiding treatment strategies for specialist care. However, effective integration of GPT-4 into clinical practice will require adaptation to specific regional health care contexts, highlighting the importance of a nuanced approach to health technology adaptation. LEVEL OF EVIDENCE: Level IV.
Ähnliche Arbeiten
Radiological Assessment of Osteo-Arthrosis
1957 · 12.364 Zit.
Traumatic Arthritis of the Hip after Dislocation and Acetabular Fractures
1969 · 5.636 Zit.
Traumatic Arthritis of the Hip After Dislocation and Acetabular Fractures: Treatment by Mold Arthroplasty: An End-Result Study Using a New Method of Result Evaluation
2013 · 5.081 Zit.
Osteoarthritis
2019 · 4.530 Zit.
Burden of major musculoskeletal conditions.
2003 · 3.532 Zit.
Autoren
Institutionen
- China Medical University(CN)
- Hôpital Jeanne de Flandre(FR)
- Université d'Abomey-Calavi(BJ)
- Centre National Hospitalier et Universitaire Hubert Koutoukou MAGA(BJ)
- Stanford University(US)
- Morgan Stanley Children's Hospital(US)
- Bai Jerbai Wadia Hospital for Children(IN)
- Ospedale dei Bambini Vittore Buzzi(IT)
- Institute of Oncology Research(CH)
- Istituto Pediatrico della Svizzera Italiana
- Ufuk University(TR)
- University of Toronto(CA)
- Hospital for Sick Children(CA)
- SickKids Foundation(CA)
- Peking University(CN)
- Beijing Jishuitan Hospital(CN)
- Nagoya University(JP)
- National University Health System(SG)