Abstract
BACKGROUND: In recent years, the fibroblast growth factor receptor (FGFR) pathway has been proven to be an important therapeutic target in bladder cancer. FGFR-targeted therapies are effective for patients with FGFR mutation, which can be discovered through genetic sequencing. However, genetic sequencing is not commonly performed at diagnosis, whereas a histologic assessment of the tumor is. We aim to computationally extract imaging biomarkers from existing tumor diagnostic slides in order to predict FGFR alterations in bladder cancer. METHODS: This study analyzed genomic profiles and H&E-stained tumor diagnostic slides of bladder cancer cases from The Cancer Genome Atlas (n = 418 cases). A convolutional neural network (CNN) identified tumor-infiltrating lymphocytes (TIL). The percentage of the tissue containing TIL ("TIL percentage") was then used to predict FGFR activation status with a logistic regression model. RESULTS: This predictive model could proficiently identify patients with any type of FGFR gene aberration using the CNN-based TIL percentage (sensitivity = 0.89, specificity = 0.42, AUROC = 0.76). A similar model which focused on predicting patients with only FGFR2/FGFR3 mutation was also found to be highly sensitive, but also specific (sensitivity = 0.82, specificity = 0.85, AUROC = 0.86). CONCLUSION: TIL percentage is a computationally derived image biomarker from routine tumor histology that can predict whether a tumor has FGFR mutations. CNNs and other digital pathology methods may complement genome sequencing and provide earlier screening options for candidates of targeted therapies.