Multi-omics single-cell data alignment and integration with enhanced contrastive learning and differential attention mechanism

利用增强型对比学习和差异注意力机制进行多组学单细胞数据比对和整合

阅读：1

作者：Zhang,Tianjiao,Zhao,Zhongqian,Zhang,Hongfei,Wu,Zhenao,Wang,Fang,Wang,Guohua

期刊：		影响因子：
时间：	2025	起止号：	2025 Aug 2;41(8)
doi：	10.1093/bioinformatics/btaf443	研究方向：	细胞生物学

Abstract

MOTIVATION: Identifying cell types that constitute complex tissue components using single-cell sequencing data is a critical issue in the field of biology. With the continuous advancement of sequencing technologies, the recognition of cell types has evolved from analyzing single-omics scRNA-seq data to integrating multi-omics single-cell data. However, existing methods for integrative analysis of high-dimensional multi-omics single-cell sequencing data have several limitations, including reliance on specific distribution assumptions of the data, sensitivity to noise, and clustering accuracy constrained by independent clustering methods. These issues have restricted improvements in the accuracy of cell type identification and hindered the application of such methods to large-scale datasets for cell type recognition. To address these challenges, we propose a novel method for aligning and integrating single-cell multi-omics data-scECDA. RESULTS: The scECDA employs independently designed autoencoders that can autonomously learn the feature distributions of each omics dataset. By incorporating enhanced contrastive learning and differential attention mechanisms, the scECDA effectively reduces the interference of noise during data integration. The model design exhibits high flexibility, enabling adaptation to single-cell omics data generated by different technological platforms. It directly outputs integrated latent features and end-to-end cell clustering results. Through the analysis of the distribution of latent features, the scECDA can effectively identify key biological markers and precisely distinguish cell subtypes, recover cluster-specific motif and infer trajectory. The scECDA was applied to eight paired single-cell multi-omics datasets, covering data generated by 10X Multiome, CITE-seq, and TEA-seq technologies. Compared to eight state-of-the-art methods, scECDA demonstrated higher accuracy in cell clustering. AVAILABILITY AND IMPLEMENTATION: The scECDA code is freely available at https://github.com/SuperheroBetter/scECDA.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用；引用内容仅为补充信息，不代表本站立场。

2、若认为本页面引用内容涉及侵权，请及时与本站联系，我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容，需注明“来源：[生知库]”并获得授权；使用引用内容的，需自行联系原作者获得许可。

4、投稿及合作请联系：info@biocloudy.com。