Abstract
BACKGROUND: Cystitis glandularis (CG) is a rare urological condition characterized by glandular metaplasia of the bladder mucosa. Recurrence following transurethral resection (TUR) is a significant clinical challenge. Traditional predictive models often fail to capture the complexity of the data, resulting in insufficient accuracy. In contrast, machine learning (ML) has demonstrated substantial potential in medical prediction by identifying and analyzing complex patterns that are undetectable by conventional methods. This study aims to develop and evaluate an interpretable ML model to predict recurrence after TUR for CG, thereby improving clinical decision-making and patient outcomes. METHODS: We analyzed predictors of recurrence using the least absolute shrinkage and selection operator (LASSO) and multivariate logistic regression. We developed and tested seven ML-based models: Cox proportional hazards model (CoxPH), LASSO regression, decision tree (rpart), random survival forest (RSF), gradient boosting machine (GBM), support vector machine (SVM), and extreme gradient boosting (XGBoost). Participants were diagnosed with CG by pathology following TUR and treated from 2012 to 2018. Model discrimination was assessed using the receiver operating characteristic (ROC) curve and area under the ROC curve (AUC), while model preference was evaluated through the Brier score (BS). Decision curve analysis (DCA) was used for model comparison. The SHapley Additive exPlanations (SHAP) method was employed for interpretation, providing insights into recurrence prediction and prevention strategies. Finally, user-friendly platform was developed, allowing users to predict CG recurrence by entering feature values into designated text boxes on the webpage. RESULTS: The RSF model demonstrated the best performance in predicting recurrence, as indicated by superior ROC, DCA, and BS metrics. In SHAP, postoperative regular instillation (PRI) contributed the most to model construction. CONCLUSIONS: The RSF model effectively predicts CG recurrence, offering a framework for individualized treatment strategies. PRI was identified as the most significant risk factor influencing recurrence.