Use of Deep-Learning Genomics to Discriminate Healthy Individuals from Those with Alzheimer's Disease or Mild Cognitive Impairment

利用深度学习基因组学区分健康个体与阿尔茨海默病或轻度认知障碍患者

阅读:1

Abstract

OBJECTIVES: Alzheimer's disease (AD) is the most prevalent neurodegenerative disorder and the most common form of dementia in the elderly. Certain genes have been identified as important clinical risk factors for AD, and technological advances in genomic research, such as genome-wide association studies (GWAS), allow for analysis of polymorphisms and have been widely applied to studies of AD. However, shortcomings of GWAS include sensitivity to sample size and hereditary deletions, which result in low classification and predictive accuracy. Therefore, this paper proposes a novel deep-learning genomics approach and applies it to multitasking classification of AD progression, with the goal of identifying novel genetic biomarkers overlooked by traditional GWAS analysis. METHODS: In this study, we selected genotype data from 1461 subjects enrolled in the Alzheimer's Disease Neuroimaging Initiative, including 622 AD, 473 mild cognitive impairment (MCI), and 366 healthy control (HC) subjects. The proposed deep-learning genomics (DLG) approach consists of three steps: quality control, coding of single-nucleotide polymorphisms, and classification. The ResNet framework was used for the DLG model, and the results were compared with classifications by simple convolutional neural network structure. All data were randomly assigned to one training/validation group and one test group at a ratio of 9 : 1. And fivefold cross-validation was used. RESULTS: We compared classification results from the DLG model to those from traditional GWAS analysis among the three groups. For the AD and HC groups, the accuracy, sensitivity, and specificity of classification were, respectively, 98.78 ± 1.50%, 98.39% ± 2.50%, and 99.44% ± 1.11% using the DLG model, while 71.38% ± 0.63%, 63.13% ± 2.87%, and 85.59% ± 6.66% using traditional GWAS. Similar results were obtained from the other two intergroup classifications. CONCLUSION: The DLG model can achieve higher accuracy and sensitivity when applied to progression of AD. More importantly, we discovered several novel genetic biomarkers of AD progression, including rs6311 and rs6313 in HTR2A, rs1354269 in NAV2, and rs690705 in RFC3. The roles of these novel loci in AD should be explored in future research.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。