Identifying proteomic prognostic markers for Alzheimer's disease with survival machine learning: The Framingham Heart Study

利用生存机器学习识别阿尔茨海默病蛋白质组学预后标志物:弗雷明汉心脏研究

阅读:2

Abstract

BACKGROUND: Protein abundance levels, sensitive to both physiological changes and external interventions, are useful for assessing the Alzheimer's disease (AD) risk and treatment efficacy. However, identifying proteomic prognostic markers for AD is challenging by their high dimensionality and inherent correlations. METHODS: Our study analyzed 1128 plasma proteins, measured by the SOMAscan platform, from 858 participants 55 years and older (mean age 63 years, 52.9 % women) of the Framingham Heart Study (FHS) Offspring cohort. We conducted regression analysis and machine learning models, including LASSO-based Cox proportional hazard regression model (LASSO) and generalized boosted regression model (GBM), to identify protein prognostic markers. These markers were used to construct a weighted proteomic composite score, the AD prediction performance of which was assessed using time-dependent area under the curve (AUC). The association between the composite score and memory domain was examined in 339 (of the 858) participants with available memory scores, and in a separate group of 430 participants younger than 55 years (mean age 46, 56.7 % women). RESULTS: Over a mean follow-up of 20 years, 132 (15.4 %) participants developed AD. After adjusting for baseline age, sex, education, and APOE ε4 + status, regression models identified 309 proteins (P ≤ 0.2). After applying machine learning methods, nine of these proteins were selected to develop a composite score. This score improved AD prediction beyond the factors of age, sex, education, and APOE ε4 + status across 15-25 years of follow-up, achieving its peak AUC of 0.84 in the LASSO model at the 22-year follow-up. It also showed a consistent negative association with memory scores in the 339 participants (beta = -0.061, P = 0.046), 430 participants (beta = -0.060, P = 0.018), and the pooled 769 samples (beta = -0.058, P = 0.003). CONCLUSION: These findings highlight the utility of machine learning method in identifying proteomic markers in improving AD prediction and emphasize the complex pathology of AD. The composite score may aid early AD detection and efficacy monitoring, warranting further validation in diverse populations.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。