Medication-Stratified Analysis of LDL-C Equation Miscalibration in Diabetes: Evidence from the All of Us Research Program and a Medication-Agnostic Machine-Learning Correction

基于药物分层的糖尿病LDL-C方程校准误差分析：来自“我们所有人”研究计划的证据和与药物无关的机器学习校正

阅读：1

作者：Doku,Ronald,Osafo,Nana Yaw,Kwagyan,John,Southerland,William M

期刊：		影响因子：
时间：	2025	起止号：	2025 Nov 17
doi：	10.1101/2025.10.23.25338682	研究方向：	代谢
疾病类型：	糖尿病

Abstract

OBJECTIVE: Standard LDL-C equations were derived in cohorts largely untreated with modern combination diabetes therapies. With medication-treated patients comprising 84% on statins, 53% on insulin, and 25% on GLP-1 receptor agonists-often in combination-we quantified medication-specific miscalibration in LDL-C equations and evaluated a machine learning correction that operates without requiring medication data. RESEARCH DESIGN AND METHODS: Using All of Us Research Program data (n=3,477; test n=696), we compared Friedewald, Martin-Hopkins, and Sampson (NIH) Equation 2 against direct LDL-C measurements. We developed a stacked ensemble model (elastic net, random forest, XGBoost, neural network) trained solely on routine laboratory values. Accuracy was assessed within medication groups allowing for combination therapy: insulin users, GLP-1 users, and statin users. Primary endpoints: mean absolute error (MAE) with 95% bootstrap confidence intervals and calibration (ordinary least squares regression of true on predicted LDL-C). Secondary endpoint: Net Reclassification Index at 100 mg/dL. RESULTS: Among 696 test participants, 587 (84%) used statins, 366 (53%) insulin, and 175 (25%) GLP-1 agonists. Patients on triple therapy (insulin+GLP-1+statin) showed the most severe miscalibration: Friedewald slope 0.29, representing 71% compression of the prediction range. In all GLP-1 users (77% also on insulin), standard equations severely underestimated LDL-C with calibration slopes of 0.42-0.48 versus ideal 1.0. Specifically, Friedewald showed slope 0.42 (95% CI 0.27-0.56) with intercept +62 mg/dL; Sampson (NIH) Equation 2 slope 0.48 (0.32-0.64) with intercept +55 mg/dL; Martin-Hopkins slope 0.47 (0.31-0.63) with intercept +55 mg/dL. The machine learning model maintained better calibration (slope 0.83 [0.56-1.09]; intercept -2.2 mg/dL) and reduced MAE by 17% versus Friedewald. Insulin users showed similar improvement: Friedewald slope 0.55 (0.45-0.65) versus the machine learning (ML) model 0.95 (0.78-1.12), with 16% lower error. The medication-by-triglyceride interaction was significant (p=0.002). In patients with insulin exposure and triglycerides ≥200 mg/dL, Net Reclassification Index was 0.240 versus 0.022 overall, indicating greater misclassification risk in hypertriglyceridemia. CONCLUSIONS: Standard LDL-C equations systematically underestimate true levels in medication-treated diabetes patients, with errors greatest in combination therapy. A machine learning model trained on routine laboratories-without medication data-achieved near-ideal calibration (slopes 0.83-1.03) and reduced errors by 8-20% across medication groups. These observational findings suggest direct LDL-C measurement or ML-assisted correction should be considered when equation estimates approach treatment thresholds, particularly for patients on combination therapy.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用；引用内容仅为补充信息，不代表本站立场。

2、若认为本页面引用内容涉及侵权，请及时与本站联系，我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容，需注明“来源：[生知库]”并获得授权；使用引用内容的，需自行联系原作者获得许可。

4、投稿及合作请联系：info@biocloudy.com。

肿瘤免疫

炎症

T细胞

线粒体

凋亡

转录调控

巨噬细胞

自噬

传染病

氧化应激

肠道菌群

磷酸化

囊泡

血管生成

3D/类器官

单细胞

中性粒细胞

外泌体

DNA甲基化

miRNA

药物研究

铁死亡

细胞衰老

乙酰化

缺氧低氧

泛素化

树突状细胞

炎性小体

肿瘤微环境

组蛋白修饰

lncRNA

代谢重编程

焦亡

m6A/m5C/m7G

内质网应激

空间多组学

细胞基因治疗

相分离

治疗耐药

Treg

上皮间质转化

免疫代谢

染色质重塑

脂质过氧化

蛋白质稳态

脂代谢

铁代谢

细胞极性

氨基酸代谢

cGAS-STING

碱基编辑

蛋白降解

肠脑轴

翻译调控

乳酸化

circRNA

piRNA

肿瘤异质性

NK 细胞

氧化脂质

MDSC

NETosis

溶酶体功能

低氧缺氧

琥珀酰化

细胞干性

CAR-NK

冷应激

RNA 编辑

Tfh

巴豆酰化

器官芯片

器官纤维化

表观遗传记忆

铜死亡

线粒体未折叠蛋白反应

空间代谢组

程序性坏死

自噬流

丙酰化

MAIT 细胞

肠肝轴