Non-Glycemic Clinical Data for Type 2 Diabetes Detection in Mexican Adults: A Comparative Analysis of Atherogenic Indices, Statistical Transformations, and Machine Learning Algorithms

墨西哥成年人2型糖尿病检测的非血糖临床数据:动脉粥样硬化指数、统计转换和机器学习算法的比较分析

阅读:1

Abstract

Background: Type 2 diabetes (T2D) is a growing public health problem in Mexico. Lipid profile alterations have been shown to appear years before changes in glycemic biomarkers, and some of the latter are limited in availability, especially in underserved settings. Therefore, anthropometric variables and lipids represent relevant early indicators for the early detection of the disease. This study evaluates the capacity of non-glycemic clinical data-including lipid profile and anthropometric indicators-to detect T2D using machine learning, and compares the performance of different feature engineering approaches. Methods: Using more than a thousand clinical records of Mexican adults, three experiments were developed: (1) a distribution and normality analysis to characterize the variability of lipid variables; (2) an evaluation of the predictive power of multiple atherogenic indices (Castelli I, Castelli II, TG/HDL, and AIP); and (3) the implementation of statistical transformations (logarithmic, quare-root, and Z-standardization) to stabilize variance and improve feature quality. Logistic regression, SVM-RBF, random forest, and XGBoost models were trained on each feature set and evaluated using accuracy, sensitivity, specificity, F1-score, and area under the ROC curve. Results: The AIP index showed the greatest discriminatory power among the atherogenic indices, while normality-based transformations improved the performance of distribution-sensitive models, such as SVM. In the final experiment, the SVM-RBF and XGBoost models achieved AUC values greater than 0.90, demonstrating the feasibility of a diagnostic approach based exclusively on non-glycemic data. Conclusions: The findings indicate that the transformed lipid profile and anthropometric variables can constitute a solid and accessible alternative for the early detection of T2D in clinical and public health contexts, offering a robust methodological framework for future predictive applications in the absence of traditional glycemic biomarkers.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。