Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
Transforming a clinical study database into a structured database adapted to artificial intelligence applications
0
Zitationen
17
Autoren
2026
Jahr
Abstract
OBJECTIVE: Medical imaging databases suitable for training machine learning/computer vision algorithms are scarce, limiting the potential for development and generalisation of clinical tools. Clinical trial databases are a source of data, known for their high-quality data and reliable annotations. However, they are not tailored to the needs of machine learning or deep learning models. Our objective was to develop a methodology and tools that enable the curation of these databases specifically for the training or testing of artificial intelligence tools. MATERIALS AND METHODS: MRIs from the French centres of the EURAD clinical trial (MRI of women with pelvic adnexal lesions) were used to constitute the database. We developed the steps required to curate a clinical trial database: definition of inclusion and exclusion criteria, removal of unnecessary data according to the principle of parsimony, quality control, and harmonisation. RESULTS: A total of 713 patients were included in our study. The directory structure was simplified, and the number of files and folders decreased by 44% and 95% respectively. Only 62 DICOM fields were considered necessary for artificial intelligence (AI) model applications. Quality control was implemented in repeated cycles of automatic checks, followed by a final manual random inspection. Finally, sequence names were harmonised for easy identification when developing models. CONCLUSION: Using a clinical trial database, we propose a methodology to build a database suitable to train or test AI algorithms. This study underlines the need for a more global and systematic framework for the secondary use of health data to develop AI imaging tools for patient care. CRITICAL RELEVANCE STATEMENT: We propose and detail a framework and tools to curate a clinical trial database to allow secondary use of the high-quality annotated data generated in clinical trials for the training and testing of artificial intelligence models. KEY POINTS: Clinical trial imaging databases are not adapted for AI model development. A curation process of MRI databases was developed for machine learning applications. We share the open-source tools and methodology developed in this study.
Ähnliche Arbeiten
Explainable Artificial Intelligence (XAI): Concepts, taxonomies, opportunities and challenges toward responsible AI
2019 · 8.674 Zit.
Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead
2019 · 8.583 Zit.
High-performance medicine: the convergence of human and artificial intelligence
2018 · 8.105 Zit.
BioBERT: a pre-trained biomedical language representation model for biomedical text mining
2019 · 6.862 Zit.
Proceedings of the 19th International Joint Conference on Artificial Intelligence
2005 · 5.781 Zit.
Autoren
Institutionen
- Université Paris Cité(FR)
- Laboratoire Informatique Paris Descartes(FR)
- Philips (France)(FR)
- Sorbonne Université(FR)
- Assistance Publique – Hôpitaux de Paris(FR)
- Hôpital Tenon(FR)
- Centre Hospitalier de Valenciennes(FR)
- Institut Paoli-Calmettes(FR)
- Université de Montpellier(FR)
- Hôpital de la Timone(FR)
- Institut Gustave Roussy(FR)
- Institut Curie(FR)
- Inserm(FR)
- Hôpital Européen Georges-Pompidou(FR)
- Hôpital Européen(FR)
- Paris Cardiovascular Research Center(FR)