Abstract
Transcriptome profiling is a cornerstone of functional genomics, enabling the detailed characterization of gene expression in health and disease. Bulk RNA sequencing (bulk RNAseq) remains the most widely used approach in clinical and large-cohort studies due to its cost-effectiveness, robustness, and comprehensive transcriptome coverage. However, bulk RNAseq inherently averages gene expression signals across heterogeneous cell populations, thereby masking cellular diversity and obscuring rare cell types. In contrast, single-cell RNA sequencing (scRNAseq) enables a high-resolution analysis of cellular heterogeneity, allowing the identification of distinct cell types, transitional states, and developmental trajectories. Nevertheless, scRNAseq is associated with higher cost, limited scalability, increased technical noise, sparse expression matrices, and protocol-dependent biases introduced during tissue dissociation or nuclear isolation. In this review, we summarize the conceptual and methodological foundations of integrating bulk RNAseq and scRNAseq data, emphasizing their complementary strengths and limitations. We discuss how scRNAseq-derived cell-type atlases can serve as reference matrices for computational reconstruction (deconvolution) of bulk RNAseq profiles and examine key sources of technical and biological variability. Furthermore, we outline major integration strategies, including reference-based deconvolution, pseudobulk aggregation, and Bayesian joint modeling to provide an overview of widely used analytical tools and essential components of scRNAseq data processing workflows.