Machine learning approaches to predict hip fracture incidence: insights from the CHARLS dataset

利用机器学习方法预测髋部骨折发生率:来自 CHARLS 数据集的启示

阅读:2

Abstract

BACKGROUND: Hip fractures are a major health concern in the older adults, severely impacting patients' quality of life and straining healthcare systems. With China's aging population, their incidence is projected to increase. Thus, developing effective prediction models to identify high-risk individuals is essential for prevention. OBJECTIVE: The aim of this study was to develop and validate a reliable and accurate machine learning-based predictive model for hip fracture incidence to improve the prediction of the risk of hip fracture in community residents. METHODS: Data were obtained from the China Health and Retirement Longitudinal Study (CHARLS), encompassing 21,095 individuals aged 45 years and older, of whom 616 reported hip fractures. Baseline data from these participants were utilized to examine 34 metrics, including demographic characteristics, lifestyle, health status, and mental health and cognitive functioning scores. Ten machine learning algorithms, including Random Forest (RF), Adaptive Boosting (AdaBoost), and Decision Tree (DT) machine learning techniques were used to analyze and determine the optimal model. The performance of the predictive models was evaluated by area under the receiver operating characteristic curve (AUC), sensitivity, specificity and F1 score. The SHapley Additive exPlanation (SHAP) interpretation was utilized in identifying the key influencing factors and individual heterogeneity was explained through instance level analysis. RESULTS: After addressing class imbalance using a class-weighting technique, we found that the Random Forest model performed the best, with an AUC of 0.93, with high sensitivity, specificity, and F1 score. Metabolic Equivalent of Task (MET), age, fall down, drinking, cognitive functioning scores, sleep duration of nap after lunch, residence, total sleep duration, and marital status were the key predictors. The model demonstrated favorable predictive performance in both the internal and external validation cohorts, indicating that the model is optimal. CONCLUSION: The machine learning-based predictive model developed in this study demonstrated strong predictive performance for incident hip fractures over a 7-year period. By incorporating readily available, modifiable lifestyle factors, the model serves as a promising tool for identifying individuals at high risk. It provides a scientific basis for developing early intervention strategies, but requires further prospective validation before clinical implementation.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。