Machine learning models integrating dietary data predict all-cause mortality in U.S. NAFLD patients: an NHANES-based study

整合膳食数据的机器学习模型预测美国非酒精性脂肪性肝病患者的全因死亡率:一项基于NHANES的研究

阅读:2

Abstract

BACKGROUND: Non-alcoholic fatty liver disease (NAFLD) is a leading cause of chronic liver disease, closely associated with metabolic abnormalities and unhealthy lifestyle habits. Despite the critical role of diet in disease progression, most existing prognostic models for NAFLD fail to incorporate dietary factors. This study aims to integrate demographic, serological, and nutritional data. It focuses on developing machine learning models that predict all-cause mortality risk in NAFLD patients, with a particular emphasis on dietary interventions. METHODS: Data from the National Health and Nutrition Examination Survey (NHANES) 2007-2018, comprising 2,589 NAFLD participants, were analyzed. Variables associated with survival outcomes were selected using LASSO-Cox regression. Five machine learning models-Random Survival Forest (RSF), Gradient Boosting Machine (GBM), CoxBoost, and Survival Support Vector Machine (SurvivalSVM), eXtreme Gradient Boosting (XGBoost) -were developed and their performance evaluated through time-dependent AUC, ROC curves, C-index, Brier score and Kaplan-Meier analysis. SHAP values were employed for model interpretability. RESULTS: LASSO-Cox regression identified 13 significant variables, including age, household income, blood glucose, sedentary behavior, dietary fiber intake and so on. The GBM and RSF models demonstrated strong predictive performance with AUC values around 0.8 for both 5- and 10-year survival predictions, and also performed well in terms of C-index and Brier score. SHAP analysis revealed that advanced age, low household income, hyperglycemia, and sedentary behavior were associated with poor prognosis, whereas higher dietary fiber intake was linked to improved survival. CONCLUSIONS: This study integrates dietary data into machine learning models, demonstrating the potential for predicting all-cause mortality in NAFLD patients. The models, particularly RSF and GBM, show robust predictive accuracy, with dietary fiber intake consistently exhibiting a protective effect on survival outcomes. These findings suggest that dietary interventions, such as increasing dietary fiber intake, could improve the long-term prognosis of NAFLD patients. CLINICAL TRIAL NUMBER: Not applicable.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。