Abstract
BACKGROUND: The heterogeneity of inflammatory bowel disease (IBD) and its unpredictable course have always been a challenge for gastroenterologists, with regard to predicting the disease response using endoscopic techniques. Machine learning (ML) models have shown some early promise in predicting treatment response in IBD patients. METHODS: We conducted a systematic review of studies investigating the application of ML to predict treatment response and remission in IBD patients. We used the CHARMS checklist for data extraction. Bias was assessed with the PROBAST tool. RESULTS: We included in our review 6 studies that evaluated numbers of IBD patients ranging from 67 to 3004. ML models demonstrated low to moderate predictive accuracy for treatment response and remission (area under the receiver operating characteristic curve: 0.489-0.811; sensitivity: 0.46-0.96; specificity: 0.56-0.98). The studies that utilized ML models with more input variables performed better. Furthermore, only 2 studies performed external validation, and half of the studies demonstrated a substantial risk of bias due to missing data/overfitting, and variability in outcome definition. CONCLUSIONS: ML models show considerable promise in predicting treatment outcomes and remission in IBD. However, given the substantial bias in studies so far, future studies should use a standardized methodology, external validation, and an interpretable broader input variable.