AUTOMATING RENAL CANCER CHART REVIEW USING LARGE LANGUAGE MODELS

Nicholas Heller; Angelica Bartholomew; Clara Goebel; Rikhil Seshadri; Beatriz Lopez Morato; Gabriel Wallerstein-king; Betty Wang; Jayant Siva; Jason Scovell; Rebecca Campbell; Michal Ozery-Flato; Vesna Resende Barros; Maria Gabrani; Michal Rosen-Zvi; Ryan Ward; Steven Campbell; Erick Remer; Christopher Weight; Robert Abouassaly

doi:10.1016/j.urolonc.2024.12.144

Urologic Oncology: Seminars And Original Investigations

Paper

01 Mar 2025

AUTOMATING RENAL CANCER CHART REVIEW USING LARGE LANGUAGE MODELS

Abstract

Clinical databases are essential for clinical and translational research. Traditionally, curating a clinical database involves manually collecting data from free text notes within the electronic medical record (EMR), but this process is time-consuming and error prone. Recently, Large Language Models (LLMs) such as OpenAI's ChatGPT and Google's Gemini have demonstrated impressive semantic understanding of free text, and could be used to automate the free text data extraction tasks that once could only be done using human experts and trainees. Unfortunately, these free text notes often contain protected health information, and moreover embody a valuable asset, leading health systems to restrict their transfer to entities like the third party AI providers mentioned above. The goal of this study is to evaluate the feasibility of avoiding data transfer by using an open source AI model to generate a clinical database of kidney cancer patients from free text radiology, pathology, and operative notes.

Paper