Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
Analyzing the performance of <scp>ChatGPT</scp> in answering inquiries about cervical cancer
16
Zitationen
3
Autoren
2024
Jahr
Abstract
OBJECTIVE: To analyze the knowledge of ChatGPT about cervical cancer (CC). METHODS: Official websites of professional health institutes, and websites created by patients and charities underwent strict screening. Using CC-related keywords, common inquiries by the public and comments about CC were searched in social media applications with these data, a list of frequently asked questions (FAQs) was prepared. When preparing question about CC, the European Society of Gynecological Oncology (ESGO), European Society for Radiotherapy and Oncology (ESTRO), and European Society of Pathology (ESP) guidelines were used. The answers given by ChatGPT were scored according to the Global Quality Score (GQS). RESULTS: When all ChatGPT answers to FAQs about CC were evaluated with regard to GQS, 68 ChatGPT answers were classified as score 5, and none of ChatGPT answers for FAQs were scored as 2 or 1. Moreover, ChatGPT answered 33 of 53 (62.3%) CC-related questions based on ESGO, ESTRO, and ESP guidelines with completely accurate and satisfactory responses (GQS 5). In addition, eight answers (15.1%), seven answers (13.2%), four answers (7.5%), and one answer (1.9%) were categorized as GQS 4, GQS 3, GQS 2, and GQS 1, respectively. The reproducibility rate of ChatGPT answers about CC-related FAQs and responses about those guideline-based questions was 93.2% and 88.7%, respectively. CONCLUSION: ChatGPT had an accurate and satisfactory response rate for FAQs about CC with regards to GQS. However, the accuracy and quality of ChatGPT answers significantly decreased for questions based on guidelines.
Ähnliche Arbeiten
Explainable Artificial Intelligence (XAI): Concepts, taxonomies, opportunities and challenges toward responsible AI
2019 · 8.740 Zit.
Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead
2019 · 8.649 Zit.
High-performance medicine: the convergence of human and artificial intelligence
2018 · 8.202 Zit.
BioBERT: a pre-trained biomedical language representation model for biomedical text mining
2019 · 6.886 Zit.
Proceedings of the 19th International Joint Conference on Artificial Intelligence
2005 · 5.781 Zit.