[Efficacy of machine learning models versus Cox regression model for predicting prognosis of esophagogastric junction adenocarcinoma]

[机器学习模型与Cox回归模型在预测食管胃交界处腺癌预后方面的疗效比较]

阅读:1

Abstract

OBJECTIVE: To compare the performance of machine learning models and traditional Cox regression model in predicting postoperative outcomes of patients with esophagogastric junction adenocarcinoma (AEG). METHODS: This study was conducted among 203 AEG patients with complete clinical and follow-up data, who were treated in our hospital between September, 2015 and October, 2020. The clinicopathological data of the patients were processed for analysis using R language package and divided into training and validation datasets at the ratio of 3:1. The Cox proportional hazards regression model and 4 machine learning models were constructed for analyzing the datasets. ROC curves, calibration curves and clinical decision curves (DCA) were plotted. Internal validation of the machine learning models was performed to assess their predictive efficacy. The predictive performance of each model was evaluated by calculating the area under the curve (AUC), and the model fitting was assessed using the calibration curve. RESULTS: For predicting 3-year survival based on the validation dataset, the AUC was 0.870 for Cox proportional hazard regression model, 0.901 for eXtreme Gradient Boosting (XGBoost), 0.791 for random forest, 0.832 for support vector machine, and 0.725 for multilayer perceptron; For predicting 5-year survival, the AUCs of these models were 0.915, 0.916, 0.758, 0.905, and 0.737, respectively. For internal validation, the AUCs of the 4 machine learning models decreased in the order of XGBoost (0.818), random forest (0.758), support vector machine (0.0.804), and multilayer perceptron (0.745). CONCLUSION: The machine learning models show better predictive efficacy for survival outcomes of patients with AEG than Cox proportional hazard regression model, especially when proportional odds assumption or linear regression models are not applicable. XGBoost models have better performance than the other machine learning models, and the multi-layer perception model may have poor fitting results for a limited data volume.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。