Delineating Metabolic Dysfunction-Associated Steatotic Liver Disease and Comorbid Risk With Clinical, Blood and Metabolomic Parameters—A UK Biobank Analysis

利用临床、血液和代谢组学参数阐明代谢功能障碍相关脂肪肝疾病及其合并症风险——一项英国生物银行分析

阅读:1

Abstract

BACKGROUND/OBJECTIVES: Ascertaining risk of and prognostication tools for metabolic dysfunction-associated steatotic liver disease (MASLD) remain suboptimal. Using the UK Biobank, we investigated the role of multivariate supervised learning in predicting MASLD risk and the presence of disease subgroups in stratifying the risk of comorbid outcomes. METHODS: Two ground truth definitions for MASLD were derived using magnetic resonance imaging–proton density fat fraction (MRI-PDFF) and electronic health records (EHRs). A training and validation set was created (60:40 split), using multiple imputation for missing data (mean missingness: 2.05%). Variable importance analyses of partial least squares-discriminant analysis (PLS-DA) and random forest (RF) were used to select important clinical, genetic, haematological, biochemical, and metabolomic features of MASLD, which were subsequently used to train models to classify disease risk. The presence of MASLD subgroups were explored by K-means clustering and the relationship with comorbid disease outcomes were investigated. RESULTS: Three thousand seven hundred ninety-nine and 2552 individuals were available for analysis for the two ground truth definitions. A total of 49 features were selected to train our final models. Model performance based on area under receiver operating characteristics for PLS-DA, RF, support vector machine linear kernel, and logistic regression were all above ≥0.80 (MRI-PDFF definition) and ≥0.85 (MRI-PDFF and EHR definition) at predicting MASLD. Unsupervised clustering of the features in MASLD identified two risk subgroups with a ∼3-fold difference in ischaemic stroke, hypertension, and extrahepatic malignancy, a ∼2-fold difference in hyperlipidaemia and T2DM, and a >1.5-fold difference in ischaemic heart disease and all-cause mortality. CONCLUSIONS: Using multivariate supervised learning models, we have identified features of MASLD in UK Biobank participants that can be used to predict the risk of comorbid diseases and outcomes. This provides insights into MASLD pathophysiology and highlights the potential of learning models for case identification and prognostication. Further external validation is required to assess applicability of the predictive features and for model optimisation.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。