Multi-modal transcriptomics: integrating machine learning and convolutional neural networks to identify immune biomarkers in atherosclerosis

多模态转录组学:整合机器学习和卷积神经网络以识别动脉粥样硬化中的免疫生物标志物

阅读:2

Abstract

BACKGROUND: Atherosclerosis, a complex chronic vascular disorder with multifactorial etiology, stands as the primary culprit behind consequential cardiovascular events, imposing a substantial societal and economic burden. Nevertheless, our current understanding of its pathogenesis remains imprecise. In this investigation, our objective is to establish computational models elucidating molecular-level markers associated with atherosclerosis. This endeavor involves the integration of advanced machine learning techniques and comprehensive bioinformatics analyses. MATERIALS AND METHODS: Our analysis incorporated data from three publicly available the Gene Expression Omnibus (GEO) datasets: GSE100927 (104 samples, 30,558 genes), which includes atherosclerotic lesions and control arteries from carotid, femoral, and infra-popliteal arteries of deceased organ donors; GSE43292 (64 samples, 23,307 genes), consisting of paired carotid endarterectomy samples from 32 hypertensive patients, comparing atheroma plaques and intact tissues; and GSE159677 (30,498 single cells, 33,538 genes), examining single-cell transcriptomes of calcified atherosclerotic core plaques and adjacent carotid artery tissues from patients undergoing carotid endarterectomy. Utilizing single-cell sequencing, highly variable atherosclerotic monocyte subpopulations were systematically identified. We analyzed cellular communication patterns with temporal dynamics. The bioinformatics approach Weighted Gene Co-expression Network Analysis (WGCNA) identified key modules, constructing a Protein-Protein Interaction (PPI) network from module-associated genes. Three machine-learning models derived marker genes, formulated through logistic regression and validated via convolutional neural network(CNN) modeling. Subtypes were clustered based on Gene Set Variation Analysis (GSVA) scores, validated through immunoassays. RESULTS: Three pivotal atherosclerosis-associated genes-CD36, S100A10, CSNK1A1-were unveiled, offering valuable clinical insights. Profiling based on these genes delineated two distinct isoforms: C2 demonstrated potent microbicidal activity, while C1 engaged in inflammation regulation, tissue repair, and immune homeostasis. Molecular docking analyses explored therapeutic potential for Estradiol, Zidovudine, Indinavir, and Dronabinol for clinical applications. CONCLUSION: This study introduces three signature genes for atherosclerosis, shaping a novel paradigm for investigating clinical immunological medications. It distinguishes the high biocidal C2 subtype from the inflammation-modulating C1 subtype, utilizing identified signature gene as crucial targets.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。