The application of risk models based on machine learning to predict endometriosis-associated ovarian cancer in patients with endometriosis

应用基于机器学习的风险模型预测子宫内膜异位症患者发生子宫内膜异位症相关卵巢癌的风险

阅读:2

Abstract

INTRODUCTION: There is currently no satisfactory model for predicting malignant transformation of endometriosis. The aim of this study was to construct and evaluate a risk model incorporating noninvasive clinical parameters to predict endometriosis-associated ovarian cancer (EAOC) in patients with endometriosis. MATERIAL AND METHODS: We enrolled 6809 patients with endometriosis confirmed by pathology, and randomly allocated them to training (n = 4766) and testing cohorts (n = 2043). The proportion of patients with EAOC in each cohort was similar. We extracted a total of 94 demographic and clinicopathologic features from the medical records using natural language processing. We used a machine learning method - gradient-boosting decision tree - to construct a predictive model for EAOC and to evaluate the accuracy of the model. We also constructed a multivariate logistic regression model inclusive of the EAOC-associated risk factors using a back stepwise procedure. Then we compared the performance of the two risk-predicting models using DeLong's test. RESULTS: The occurrence of EAOC was 1.84% in this study. The logistic regression model comprised 10 selected features and demonstrated good discrimination in the testing cohort, with an area under the curve (AUC) of 0.891 (95% confidence interval [CI] 0.821-0.960), sensitivity of 88.9%, and specificity of 76.7%. The risk model based on machine learning had an AUC of 0.942 (95% CI 0.914-0.969), sensitivity of 86.8%, and specificity of 86.7%. The machine learning-based risk model performed better than the logistic regression model in DeLong's test (p = 0.036). Furthermore, in a prospective dataset, the machine learning-based risk model had an AUC of 0.8758, a sensitivity of 94.4%, and a specificity of 73.8%. CONCLUSIONS: The machine learning-based risk model was constructed to predict EAOC and had high sensitivity and specificity. This model could be of considerable use in helping reduce medical costs and designing follow-up schedules.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。