Development of a machine learning-based predictive model for osteoporosis risk and its application in clinical decision support

开发基于机器学习的骨质疏松风险预测模型及其在临床决策支持中的应用

阅读:1

Abstract

OBJECTIVE: This study was aimed at developing an interpretable machine learning model for predicting osteoporosis (OP) risk using real-world clinical data, and at establishing a web-based visualization tool for assisting clinical decision-making. METHODS: A total of 5,328 individuals from the Affiliated Hospital of Jiangxi University of Chinese Medicine (2015-2024) were included. Multidimensional data, including demographic characteristics, anthropometric measures, lumbar spine bone mineral density (L1-L4), and more than 90 blood biochemical and inflammatory markers, were collected. Key variables were identified using univariate analysis followed by least absolute shrinkage and selection operator (LASSO) regression. Five machine learning algorithms-Decision Tree, Random Forest, XGBoost, CatBoost, and Multi-Layer Perceptron (MLP)-were developed and compared. SHapley Additive exPlanations (SHAP) analysis was conducted to enhance model interpretability, and a web-based tool was subsequently developed based on the best-performing model. RESULTS: Five key predictive variables-age, sex, body mass index (BMI), uric acid (UA), and alkaline phosphatase (ALP)-were ultimately selected. Among the five models evaluated, the Random Forest model achieved the highest AUC (0.759) in the test set, demonstrating moderate discriminative performance and good model stability. SHAP analysis revealed that BMI contributed most to the model's predictions, while increased age, female sex, elevated ALP, and reduced UA were associated with a higher risk of osteoporosis. Based on this model, a web-based tool was developed to enable individualized risk prediction and feature-level visualization, providing a quantitative reference for clinical risk assessment. CONCLUSION: The osteoporosis prediction model developed in this study achieved quantitative risk estimation and interpretable outputs using a limited set of features, providing a feasible technical approach for early screening of osteoporosis. Future work should focus on external validation and recalibration in multicenter populations to further evaluate and optimize the model's predictive performance and clinical applicability.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。