Development and validation of a nomogram model of lung metastasis in breast cancer based on machine learning algorithm and cytokines

基于机器学习算法和细胞因子的乳腺癌肺转移列线图模型的开发与验证

阅读:1

Abstract

BACKGROUND: The relationship between cytokines and lung metastasis (LM) in breast cancer (BC) remains unclear and current clinical methods for identifying breast cancer lung metastasis (BCLM) lack precision, thus underscoring the need for an accurate risk prediction model. This study aimed to apply machine learning algorithms for identifying the key risk factors for BCLM before developing a reliable prediction model centered on cytokines. METHODS: This population-based retrospective study included 326 BC patients admitted to the Second Affiliated Hospital of Xuzhou Medical University between September 2018 and September 2023. After randomly assigning the patients to a training cohort (70%; n = 228) or a validation cohort (30%; n = 98) the risk factors for BCLM were identified using Least Absolute Shrinkage and Selection Operator (LASSO), Extreme Gradient Boosting (XGBoost) and Random Forest (RF) models. Significant risk factors were visualized with a Venn diagram and incorporated into a nomogram model, the performance of which was then evaluated according to three criteria, namely discrimination, calibration and clinical utility using calibration plots, receiver operating characteristic (ROC) curves and decision curve analysis (DCA). RESULTS: Among the cohort, 70 patients developed LM. A nomogram was then developed to predict the 5-year and 10-year BCLM risk by incorporating five key variables, namely endocrine therapy, hsCRP, IL6, IFN-ɑ and TNF-ɑ. For the 5-year prediction model, the training and validation cohorts had AUC values of 0.786 (95% CI: 0.691-0.881) and 0.627 (95% CI: 0.441-0.813), respectively, while for the 10-year prediction model, the corresponding AUC values were 0.687 (95% CI: 0.528-0.847) and 0.797 (95% CI: 0.605-0.988), respectively. ROC analysis further confirmed the model's strong discriminative ability, while calibration plots indicated that the predicted and observed outcomes were in good agreement in both cohorts. Finally, DCA demonstrated the model's effectiveness in clinical practice. CONCLUSION: Using machine learning algorithms, this study developed aa nomogram that could effectively identify BC patients who were at a higher risk of developing LM, thus providing a valuable tool for decision-making in clinical settings.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。