A Machine Learning Model Integrating Preoperative Blood-Based Indices for Early and Noninvasive Detection of Endometrial Cancer

结合术前血液指标的机器学习模型用于子宫内膜癌的早期无创检测

阅读:1

Abstract

BACKGROUND: Endometrial cancer (EC) incidence is rising globally, yet early diagnosis remains challenging. Our objective is to develop a non-invasive, preoperative tool to predict EC risk using machine learning (ML) techniques. METHODS: This retrospective analysis included clinical data from patients with endometrial lesions at the Third Affiliated Hospital of Sun Yat-sen University between January 2014 to August 2024. Six machine learning techniques including Random Forest (RF), Extreme Gradient Boosting (XGBoost), Support Vector Mac (SVM), Gradient Boosting Machine Model (GBDT), Logistic Regression (LR) and Multi-Layer Perceptron (MLP) were used to construct the prediction model of endometrial cancer. Receiver operating characteristic curve (ROC) was used to evaluate the model. S Hapley Additive ExPlanation (SHAP) analysis was applied to determine the predictive role of each feature in the model with the highest predictive performance. RESULTS: A total of 857 patients were included in the study. Eight baseline characteristics (Age, BMI, Gravidity, Parity, Family history, Menopause status, Diabetes, Hypertension), one imaging feature (Endometrial thickness) and eight peripheral blood-based markers (WBC, NLR, MLR, PLR, SII, SIRI, CA-125, HE4) were selected for develop and validate the machine learning model, these features were obtained noninvasively. Data from 686 patients were randomly assigned to the training group, and data from 171 patients were used for internal validation. Among the six-machine learning model, GBDT had the highest prediction, the model achieved an AUC of 0.95 (95% CI: 0.93-0.97), accuracy of 90.0% and a Brier score of 0.06. The SHAP analysis showed that HE4, CA-125 and SIRI were the most influential contributors to the prediction. CONCLUSION: We developed and validated a GBDT prediction model, which showed the best performance in predicting endometrial cancer. This model can be applied in clinical practice to effectively predict the risk of EC for patients.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。