Abstract
BACKGROUND: Early identification of infants with necrotizing enterocolitis (NEC) at risk of surgery is essential for an effective treatment. This study aims to clarify the risk factors of surgical NEC and establish a prediction model by machine learning algorithm. METHODS: Infants with NEC were split into two groups based on whether they had surgery or not. Clinical data was collected and compared between the groups. Variables were analyzed with one-way logistic regression and predictive models were built using logistic regression and CatBoost algorithm. The models were evaluated and compared using Receiver Operating Characteristic (ROC) curves and feature importance. Feature importance was ranked using the SHapley Additive exPlanation method and model optimization was performed using feature culling. Final model was selected and a user-friendly GUI software was created for clinical use. RESULTS: The Catboost model performed better than the logistic regression model in terms of discriminative power. An interpretable final model with 14 features was built after the features were reduced according to the feature importance level. The final model accurately identified Surgicel NEC in the internal validation (AUC = 0.905) and was translated into a convenient tool to facilitate its use in clinical settings. CONCLUSIONS: Catboost machine learning model related to infants with surgical NEC was successfully developed. A GUI interface was developed to assist clinicians in accurately identifying children who would benefit from surgery.