Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
Implications of the artificial intelligence extensions to the guidelines for consolidated standards of reporting trials and for standard protocol item recommendations for interventional trials (the CONSORT-AI and SPIRIT-AI extensions)
2
Zitationen
2
Autoren
2020
Jahr
Abstract
Artificial intelligence (AI), the theory and development of computer systems able of performing tasks which normally need the application of human intelligence, holds great promise for improving health outcomes and experiences [[1]Lee M.R. Sankar V. Hammer A. et al.Using machine learning to classify individuals with alcohol use disorder based on treatment seeking status.EClinicalMedicine. 2019; 12: 70-78https://doi.org/10.1016/j.eclinm.2019.05.008Summary Full Text Full Text PDF PubMed Scopus (14) Google Scholar,[2]Liu Q.P. Xu X. Zhu F.P. et al.Prediction of prognostic risk factors in hepatocellular carcinoma with transarterial chemoembolization using multi-modal multi-task deep learning.EClinicalMedicine. 2020; 23100379https://doi.org/10.1016/j.eclinm.2020.100379Summary Full Text Full Text PDF PubMed Scopus (11) Google Scholar]. However, there is some anxiety around the safety and transparency (or ‘black box’) of AI systems which may impede their integration into healthcare and society more widely. The timely release of the CONSORT and SPIRIT extensions for interventions involving artificial intelligence is welcome, and promises to improve the quality of RCTs involving AI [[3]Liu X. Cruz Rivera S. Moher D. Calvert M.J. Denniston A.K. the SPIRIT-AI and CONSORT-AI Working GroupReporting guidelines for clinical trial reports for interventions involving artificial intelligence: the CONSORT-AI extension.Lancet Dig Health. 2020; (In print)Summary Full Text Full Text PDF PubMed Scopus (16) Google Scholar,[4]Cruz Rivera S. Liu X. Chan A.W. Denniston A.K. and the SPIRIT-AI and CONSORT-AI Working GroupGuidelines for clinical trial protocols for interventions involving artificial intelligence: the SPIRIT-AI extension.Lancet Dig Health. 2020; (In print)Summary Full Text Full Text PDF Scopus (24) Google Scholar]. Synthesis of Randomised controlled trials (RCTs) through systematic review and meta-analysis, championed by the Cochrane Collaboration, permits critical overview of the literature, but this ‘secondary research’ is dependant on the quality of individual studies. In 1996, in recognition of the poor quality of many reported RCTs, a multinational expert working group published the Consolidated Standards of Reporting Trials (CONSORT) Statement, a set of evidence based recommendations for reporting randomized trials [[5]CONSORT. Consolidated standards of reporting trials (CONSORT) statement Available from: http://www.consort-statement.org/, Accessed 18th August 2020Google Scholar]. The CONSORT statement and accompanying Standard Protocol Items: Recommendations for Interventional Trials (SPIRIT) Statement have evolved and extended alongside changes in trial design, and have been endorsed by over 585 journals [[5]CONSORT. Consolidated standards of reporting trials (CONSORT) statement Available from: http://www.consort-statement.org/, Accessed 18th August 2020Google Scholar]. The recent initiative to develop AI-elaboration items for RCTs followed a rigorous consensus process, adopting the EQUATOR Network's methodological framework. Literature review was followed by expert consultation and 2-stage Delphi process involving 103 international stakeholders (journal editors, peer reviewers, pharmaceutical companies, regulatory bodies, academic institutions, funding agencies, clinicians). The CONSORT-AI and SPIRIT-AI elaborations add 14 items under 6 sections, and 15 items under 7 sections, respectively. Specifically, new items within the extensions include: clarity in the manuscript title and background rationale about the intended use of the AI application within a clinical pathway; the requirements for integration into the trial setting; the eligibility criteria both for participants and input data, including how poor quality or unavailable data were assessed and handled and whether there was human-AI interaction in handling the input data; specifying the intervention and protocol for its use and application to decision making or other areas of clinical practice precisely; detailing plans to identify and analyse performance errors; and reporting access to the AI intervention and/or its code. Importantly, commentary reported within the Delphi process revealed that the stakeholder panel appreciated the profound importance of the challenge presented by integration of AI into clinical trials, highlighting unpredictable errors “which are not easily detectable or explainable by human judgement.” The stakeholder panel recognised the potential ease with which AI systems could be deployed at scale, and related concerns that, “unintended harmful consequences could be catastrophic.” A particularly important addition resulting from this process was CONSORT-AI item 19, recommending analysis of systematic performance errors by the algorithm and their consequences. Furthermore, a notable exclusion from this initiative was ‘continuously evolving’ AI systems which are currently in early development, with few tangible examples in healthcare applications. The panel identified the risks inherent in incremental software changes, which could impact safety performance, and highlighted a need for rigorous software version management and post-deployment surveillance. The CONSORT-AI and SPIRIT-AI elaborations represent a very important and timely advance towards enhancing the quality of study design and reporting for new AI interventions, and supporting the wider community in their transparent evaluation, including consideration of risk of bias. However, it is worth remembering that even with improved quality of RCT design and reporting [[6]Moher D. Hopewell S. Schulz K.F. et al.CONSORT 2010 explanation and elaboration: updated guidelines for reporting parallel group randomised trials.Int J Surgery. 2012; 10: 28-55https://doi.org/10.1016/j.ijsu.2011.10.001Crossref PubMed Scopus (925) Google Scholar], few of the 20,000 RCT papers published annually translate into clinical benefit for the wider target population [[7]National Institute of Health. NCBI PubMed. Available from: https://pubmed.ncbi.nlm.nih.gov/, Accessed 18th August 2020Google Scholar,[8]Heneghan C. Goldacre B. Mahtani K.R Why clinical trial outcomes fail to translate into benefits for patients.Trials. 2017; 18: 122https://doi.org/10.1186/s13063-017-1870-2Crossref PubMed Scopus (140) Google Scholar]. In heralding the development of CONSORT-AI and SPIRIT-AI, we must not forget the multiple obstacles to the implementation of RCT findings. These challenges may be generic to RCTs, for example the selection of outcomes which are insufficiently patient centred or precise. They may also be particular to AI, for example the inadvertent propagation and magnification of health care disparities around gender, ethnicity and socioeconomic status [[9]Schwalbe N. Wahl B. Artificial intelligence and the future of global health.Lancet. 2020; 395: 1579-1586https://doi.org/10.1016/s0140-6736(20)30226-9Summary Full Text Full Text PDF PubMed Scopus (0) Google Scholar,[10]Noseworthy P.A. Attia Z.I. Brewer L.C. et al.Assessing and mitigating bias in medical artificial intelligence: the effects of race and ethnicity on a deep learning model for ECG analysis.Circul Arrhythmia Electrophysiolo. 2020; 13e007988https://doi.org/10.1161/circep.119.007988Crossref PubMed Google Scholar], the heterogeneity of real world health care data maturity preventing widespread deployment of the AI-based intervention [[9]Schwalbe N. Wahl B. Artificial intelligence and the future of global health.Lancet. 2020; 395: 1579-1586https://doi.org/10.1016/s0140-6736(20)30226-9Summary Full Text Full Text PDF PubMed Scopus (0) Google Scholar], or uncertainty over where moral accountability sits with regards to patient harm. Stakeholder involvement, and harmonisation of data estates across health care settings is central to the pathway to impact for AI based interventions. Nevertheless, harnessing the power of AI in order to develop interventions which are then rigorously assessed promises great benefit for patients and for population health. The authors have no conflicts of interest to disclose. AL Solebo received supported from the National Institute for Health Research Biomedical Research Centre (NIHR BRC) based at Great Ormond Street Hospital NHS Foundation Trust and UCL GOS Institute of Child Health, and through an NIHR Clinician Scientist award (CS-2018-18-ST2-005). T Braithwaite received support from patient charity Olivia's Vision.
Ähnliche Arbeiten
Explainable Artificial Intelligence (XAI): Concepts, taxonomies, opportunities and challenges toward responsible AI
2019 · 8.336 Zit.
Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead
2019 · 8.207 Zit.
High-performance medicine: the convergence of human and artificial intelligence
2018 · 7.607 Zit.
Proceedings of the 19th International Joint Conference on Artificial Intelligence
2005 · 5.776 Zit.
Peeking Inside the Black-Box: A Survey on Explainable Artificial Intelligence (XAI)
2018 · 5.476 Zit.