Tlalpan 2020 Case Study: Enhancing Uric Acid Level Prediction with Machine Learning Regression and Cross-Feature Selection

Tlalpan 2020 案例研究:利用机器学习回归和交叉特征选择提高尿酸水平预测精度

阅读:2

Abstract

Background/Objectives: Uric acid is a key metabolic byproduct of purine degradation and plays a dual role in human health. At physiological levels, it acts as an antioxidant, protecting against oxidative stress. However, excessive uric acid can lead to hyperuricemia, contributing to conditions like gout, kidney stones, and cardiovascular diseases. Emerging evidence also links elevated uric acid levels with metabolic disorders, including hypertension and insulin resistance. Understanding its regulation is crucial for preventing associated health complications. Methods: This study, part of the Tlalpan 2020 project, aimed to predict uric acid levels using advanced machine learning algorithms. The dataset included clinical, anthropometric, lifestyle, and nutritional characteristics from a cohort in Mexico City. We applied Boosted Decision Trees (Boosted DTR), eXtreme Gradient Boosting (XGBoost), Categorical Boosting (CatBoost), and Shapley Additive Explanations (SHAP) to identify the most relevant variables associated with hyperuricemia. Feature engineering techniques improved model performance, evaluated using Mean Squared Error (MSE), Root-Mean-Square Error (RMSE), and the coefficient of determination (R(2)). Results: Our study showed that XGBoost had the highest accuracy for anthropometric and clinical predictors, while CatBoost was the most effective at identifying nutritional risk factors. Distinct predictive profiles were observed between men and women. In men, uric acid levels were primarily influenced by renal function markers, lipid profiles, and hereditary predisposition to hyperuricemia, particularly paternal gout and diabetes. Diets rich in processed meats, high-fructose foods, and sugary drinks showed stronger associations with elevated uric acid levels. In women, metabolic and cardiovascular markers, family history of metabolic disorders, and lifestyle factors such as passive smoking and sleep quality were the main contributors. Additionally, while carbohydrate intake was more strongly associated with uric acid levels in women, fructose and sugary beverages had a greater impact in men. To enhance model robustness, a cross-feature selection approach was applied, integrating top features from multiple models, which further improved predictive accuracy, particularly in gender-specific analyses. Conclusions: These findings provide insights into the metabolic, nutritional characteristics, and lifestyle determinants of uric acid levels, supporting targeted public health strategies for hyperuricemia prevention.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。