Machine learning-based prediction of NAFLD in patients with type 2 diabetes using routine clinical and biochemical indicators

利用常规临床和生化指标,通过机器学习预测2型糖尿病患者的非酒精性脂肪性肝病

阅读:1

Abstract

OBJECTIVES: This study aimed to develop and evaluate machine learning (ML) models for predicting non-alcoholic fatty liver disease (NAFLD) in patients with type 2 diabetes mellitus (T2DM) using readily accessible clinical and biochemical indicators. METHODS: A total of 2,459 patients with T2DM were enrolled in this cross-sectional study. Eight ML algorithms, logistic regression (LG), k-nearest neighbors (k-NN), support vector machine (SVM), decision tree (DT), random forest (RF), gradient boosting machine (GBM), extreme gradient boosting (XGBoost), and naïve Bayes (NB), were developed to construct predictive models. Feature selection was performed using Boruta, recursive feature elimination, and LASSO regression. Model performance was assessed using several metrics, including the area under the receiver operating characteristic curve (AUC), accuracy, recall, F1 score, and decision curve analysis. RESULTS: Among the study population, 1,309 individuals (53.23%) were diagnosed with NAFLD. Sixteen variables, including BMI, waist circumference, systolic blood pressure, triglycerides, HDL-C, ALT, GGT, bilirubin fractions, albumin, BUN, GFR, fasting insulin, RBC, and hemoglobin, were selected as key predictors. The SVM model demonstrated the best overall performance, achieving an AUC of 0.920, accuracy of 0.839, and specificity of 0.898 in the training set, and an AUC of 0.833 and accuracy of 0.733 in the validation set. Decision curve analysis confirmed superior clinical utility of the SVM model compared with other algorithms. CONCLUSIONS: ML-based models, particularly the SVM algorithm, effectively predicted NAFLD among patients with T2DM using easily accessible clinical and biochemical indicators. These findings highlight the potential utility of ML-assisted screening tools for improving early identification and risk stratification of NAFLD in diabetic populations.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。