Development of machine learning-based prognostic models for small cell lung cancer with brain metastases: an analysis of SEER and Chinese populations

基于机器学习的小细胞肺癌脑转移预后模型开发:基于SEER和中国人群的分析

阅读:1

Abstract

BACKGROUND: Accurate survival prediction is critical for optimizing clinical management in small cell lung cancer (SCLC) patients with initial brain metastasis (BM). This study aims to develop a machine learning (ML)-based prognostic model to improve survival prediction, thereby enhancing clinical decision-making and treatment outcomes. METHODS: Data on SCLC patients with initial BM were extracted from the Surveillance, Epidemiology, and End Results (SEER) database for the period from 2010 to 2020 and split into training (70%) and testing (30%) sets. We used COX proportional hazards regression model to select modeling features in the training set. Four ML methods, including least absolute shrinkage and selection operator (LASSO), random forest (RF), eXtreme Gradient Boosting (XGBoost), and gradient boosting machine (GBM), were used to establish prognostic models and validated by both the SEER database and patients from Harbin Medical University Cancer Hospital. Model performance was evaluated using area under the curve (AUC) values, accuracy and F1 scores. Furthermore, we conducted a prioritization of the importance of features by SHapley Additive exPlanations (SHAP) at various time points. RESULTS: A total of 4,227 patients were enrolled in the SEER database, with 2,958 cases allocated to the training set and 1,269 to the testing set. Based on the results of univariate and multivariate COX regression analyses, we identified age, sex, marital, primary tumor size, N stage, bone metastasis, liver metastasis, lung metastasis, months from diagnosis to therapy, surgery, radiotherapy, and chemotherapy as model features. The LASSO model outperformed other models, with AUCs of 0.771, 0.724, 0.753, 0.718 at 6 months, 1 year, 2 years, and 3 years in the testing set, and 0.801, 0.763, 0.838, and 0.900 in the external validation set. Additionally, feature importance analysis consistently identified liver metastasis (highest rank), surgery, radiotherapy, lung metastasis, and chemotherapy as key predictors across all time points. CONCLUSIONS: The LASSO model demonstrated high accuracy in predicting survival for SCLC patients with initial BM, particularly in external validation. This model may provide valuable prognostic insights for personalized treatment strategies.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。