Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
From knowledge generation to knowledge verification: examining the biomedical generative capabilities of ChatGPT
7
Zitationen
4
Autoren
2025
Jahr
Abstract
The generative capabilities of LLM models offer opportunities for accelerating tasks but raise concerns about the authenticity of the knowledge they produce. We present a computational approach that evaluates the factual accuracy of biomedical knowledge generated by an LLM. Our approach consists of generating disease-centric associations and verifying them using biomedical ontologies. Using ChatGPT, we designed prompt-engineering processes to establish linkages between diseases and related drugs, symptoms, and genes, and assessed consistency across multiple ChatGPT models (e.g., GPT-4, GPT-4o, and GPT-4o-mini). Results demonstrate high accuracy in identifying disease terms (88%-97%), drug names (90%-91%), and genetic information (88%-98%). Symptom term identification was lower (49%-61%) due to informal symptom descriptions. Verification reveals coverage of 89%-91% for disease-drug and disease-gene pairs; symptom-related associations show lower coverage (49%-62%). Despite high term accuracy, generated IDs were often invalid or redundant. GenAI tools can be reliable if used with care. Retrieval Augmented Generation (RAG) may enhance reliability.
Ähnliche Arbeiten
Explainable Artificial Intelligence (XAI): Concepts, taxonomies, opportunities and challenges toward responsible AI
2019 · 8.719 Zit.
Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead
2019 · 8.628 Zit.
High-performance medicine: the convergence of human and artificial intelligence
2018 · 8.176 Zit.
BioBERT: a pre-trained biomedical language representation model for biomedical text mining
2019 · 6.880 Zit.
Proceedings of the 19th International Joint Conference on Artificial Intelligence
2005 · 5.781 Zit.