Development and external validation of models to improve prediction of osteoporosis in elderly women: interpretable machine learning

开发和外部验证用于改进老年女性骨质疏松症预测的模型：可解释机器学习

阅读：1

作者：Tang,Tian,Wang,Shiwen,Cai,Shengziyi,Hu,Yun

期刊：	Frontiers in Endocrinology	影响因子：	4.600
时间：	2025	起止号：	2025;16:1719698
doi：	10.3389/fendo.2025.1719698	研究方向：	代谢
疾病类型：	骨质疏松

Abstract

INTRODUCTION: As populations age and the prevalence of osteoporosis (OP) increases, osteoporotic fractures substantially raise disability and mortality and impose growing economic burdens, threatening health and quality of life. This study aimed to develop and externally validate a reliable, practical machine learning model to predict OP in older women using routine clinical test results and comorbidity data. METHODS: We retrospectively assembled an internal dataset from NHANES (2003-2020) and randomly split it 70:30 into training and test sets. An external cohort from a Chinese tertiary hospital was used for validation. Predictors were selected using LASSO in the training set. Five algorithms (XGBoost, SVM, RF, LightGBM, and Naive Bayes) were tuned, and model performance was evaluated on the test set and in the external cohort. Calibration curves and decision curve analysis (DCA) were used to assess calibration and clinical net benefit. Feature contributions were quantified with Shapley additive explanations (SHAP). RESULTS: Among 3,950 women in the internal dataset, 833 (21.1%) had OP; in the external cohort (n=338), 167 (49.4%) had OP. SHAP ranked predictors (high to low) as: age, drinking, diabetes, eGFR, HbA1c, BMI, HDL, TG, BUN, and TBIL. After hyperparameter tuning, RF achieved an AUC of 0.805 in the internal test set and 0.740 in the external cohort; in the internal test set, accuracy was 0.82, precision 0.83, and specificity 0.97. Calibration was acceptable, and DCA indicated clinical utility across relevant thresholds. CONCLUSION: A random forest model using readily available clinical data predicts osteoporosis risk in older women with robust internal and external performance. The deployed model outputs calibrated probabilities at the patient level, provides case level explanations using SHAP, and supports dynamic rescoring as new routine results become available, enabling individualized risk management in routine care.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用；引用内容仅为补充信息，不代表本站立场。

2、若认为本页面引用内容涉及侵权，请及时与本站联系，我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容，需注明“来源：[生知库]”并获得授权；使用引用内容的，需自行联系原作者获得许可。

4、投稿及合作请联系：info@biocloudy.com。