Abstract
PURPOSE: Collecting information on clinical outcomes (recurrence/progression) from complex treatment courses in non-muscle invasive bladder cancer (NMIBC) is challenging and time-consuming. We developed a deep learning natural language processing model to assess outcomes in patients with NMIBC using vast data from electronic health records (EHRs). METHODS: This retrospective study analyzed data from Japanese adults with NMIBC who started Bacillus Calmette-Guérin (BCG) induction therapy between April 2016 and June 2022. A Bidirectional Encoder Representations from Transformers (BERT) model was trained to classify outcomes, supported by human review for past history records. The model's performance was assessed by precision, recall, and F1 scores. We compared the effectiveness of BCG therapy between completion (patients who completed therapy) and non-completion groups. RESULTS: Of 372 patients studied, 79.3% and 20.7% were in the completion group and the non-completion group, respectively. The final BERT model achieved average F1 scores of 0.91 and 0.98 for time to recurrence (TTR), and 0.74 and 0.94 for time to progression (TTP) before and after human support, respectively. The hazard ratio for TTR in BCG completion versus non-completion groups was 0.40 (95% CI, 0.26 to 0.62) by a multivariate Cox proportional hazard model and 0.41 (95% CI, 0.26 to 0.63) by inverse probability of treatment weighting. CONCLUSION: The developed model could compare the clinical outcomes between treatments in patients with NMIBC using EHRs. Human support, although required, was needed in only 10% documents and was deemed feasible. The model was able to demonstrate the difference in TTR and TTP between BCG completion and non-completion groups.