cfDiffusion: diffusion-based efficient generation of high quality scRNA-seq data with classifier-free guidance.

阅读:4
作者:Zhang Tianjiao, Zhao Zhongqian, Ren Jixiang, Zhang Ziheng, Zhang Hongfei, Wang Guohua
Single-cell RNA sequencing (scRNA-seq) technology provides a powerful means to measure gene expression at the individual cell level, thereby uncovering the intricate cellular heterogeneity that underlies various biological processes, including embryonic development, tumor metastasis, and microbial reproduction. However, the variable amounts of data generated across different cell types within tissues can compromise the accuracy of downstream analyses. Traditional approaches for generating scRNA-seq simulation data often rely on predefined data distributions, which can negatively impact the quality of the simulated data. Furthermore, these methods typically focus on simulating single-attribute cells, necessitating substantial additional data for the simulation of multi-attribute cells, which can lead to increased training times. To address these limitations, we propose cfDiffusion, a novel method grounded in diffusion models that incorporates Classifier-Free Guidance and a high-level feature caching mechanism. By leveraging Classifier-Free Guidance, cfDiffusion significantly reduces the training costs associated with model development compared to traditional Classifier Guidance methods. The integration of a caching mechanism further enhances efficiency by shortening inference times. While the inference duration of cfDiffusion remains longer than that of scDiffusion, it exhibits superior expressiveness and efficiency in generating multi-attribute single-cell data. Evaluated across datasets from multiple sequencing platforms, cfDiffusion consistently outperforms state-of-the-art models across various performance metrics. Additionally, cfDiffusion enables the simulation of single-cell data along a pseudo-time scale, facilitating advanced analyses such as tracking cell differentiation, investigating intercellular communication, and elucidating cellular heterogeneity.

特别声明

1、本文转载旨在传播信息,不代表本网站观点,亦不对其内容的真实性承担责任。

2、其他媒体、网站或个人若从本网站转载使用,必须保留本网站注明的“来源”,并自行承担包括版权在内的相关法律责任。

3、如作者不希望本文被转载,或需洽谈转载稿费等事宜,请及时与本网站联系。

4、此外,如需投稿,也可通过邮箱info@biocloudy.com与我们取得联系。