Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
Abstract 5507: A user-friendly, no-code, application for HIPAA-compliant automated analysis of tabular data at scale.
0
Zitationen
1
Autoren
2026
Jahr
Abstract
Abstract Much research in clinical settings involves review of large quantities of often unstructured data in electronic medical records, which is time consuming and requires an educated workforce of clinical research coordinators, residents and/or fellows. Large language models, such as ChatGPT, have demonstrable capabilities for efficiently analyzing and summarizing large volumes of text, offering the potential to greatly accelerate chart review-based research in clinical settings. Concerns regarding leakage of personal health information (PHI) sharply limit the potential to utilize commercial chatbots for this purpose. Secure, fire-walled in-house LLM chatbots in a well-governed system can solve concerns related to PHI leakage. Although processing data for large studies one-prompt-at-a-time may offer substantial efficiency gains versus prior human-only processes, far greater efficiencies can be achieved by automating the prompt submission and retrieval process. To address these concerns, we developed a user-friendly application for processing any form of tabular data at scale using an in-house implementation of ChatGPT 4o-mini on Emplify Health’s Azure cloud environment. The application imports tabular data (in excel or text format), guides the user through a straightforward prompt engineering process and then automatically submits the data, row-by-row, to the LLM and retrieves and tabulates the returning data in the manner specified by the user. The application was initially developed with the goal of rapidly reasoning through thousands of pathology reports to identify those which correspond to cancer cases and, for those cases, automatically extracting multiple cancer-related parameters. Following this early success, we developed a completely generalized data agnostic application which has found widespread utility within our research institute, being deployed on diverse data sources such as cancer registries, cardiology CT reports, imaging narratives and clinical encounter notes. In this way, the application has substantially accelerated research projects by dramatically reducing the time needed to retrieve data, enabling our personnel to spend more time on higher yield activities such as data analysis. Citation Format: Paraic A. Kenny, . A user-friendly, no-code, application for HIPAA-compliant automated analysis of tabular data at scale [abstract]. In: Proceedings of the American Association for Cancer Research Annual Meeting 2026; Part 1 (Regular Abstracts); 2026 Apr 17-22; San Diego, CA. Philadelphia (PA): AACR; Cancer Res 2026;86(7 Suppl):Abstract nr 5507.
Ähnliche Arbeiten
Explainable Artificial Intelligence (XAI): Concepts, taxonomies, opportunities and challenges toward responsible AI
2019 · 8.553 Zit.
Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead
2019 · 8.444 Zit.
High-performance medicine: the convergence of human and artificial intelligence
2018 · 7.943 Zit.
BioBERT: a pre-trained biomedical language representation model for biomedical text mining
2019 · 6.792 Zit.
Proceedings of the 19th International Joint Conference on Artificial Intelligence
2005 · 5.781 Zit.