CelLink: integrating single-cell multi-omics data with weak feature linkage and imbalanced cell populations

CelLink:整合具有弱特征关联性和不平衡细胞群体的单细胞多组学数据

阅读:1

Abstract

Single-cell multi-omics technologies capture complementary molecular layers, enabling a comprehensive view of cellular states and functions. However, integrating these data types poses significant challenges when their features are weakly linked and cell population sizes are imbalanced. Currently, no method efficiently addresses these two issues simultaneously. Therefore, we developed CelLink, a novel single-cell multi-omics data integration method designed to overcome these challenges. CelLink normalizes and smooths feature profiles to align scales across datasets and integrates them through a multi-phase pipeline that iteratively employs the optimal transport algorithm. It dynamically refines cell-cell correspondences, identifying and excluding cells that cannot be reliably matched, thus avoiding performance degradation caused by erroneous imputations. This approach effectively adapts to weak feature linkage and imbalanced cell populations between datasets. Benchmarking CelLink on scRNA-seq and spatial proteomics datasets, as well as paired CITE-seq data, demonstrates its superior performance across various evaluation metrics, including data mixing, cell manifold structure preservation, and feature imputation accuracy. Compared to state-of-the-art methods, CelLink significantly outperforms others in imbalanced cell populations while consistently achieving better performance for balanced datasets. Moreover, CelLink uniquely enables cell subtype annotation, correction of mislabeled cells, and spatial transcriptomic analyses by imputing transcriptomic profiles for spatial proteomics data. Its great ability to impute large-scale paired single-cell multi-omics data positions it pivotal for building single-cell multi-modal foundation models and spatial cellular biology.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。