Abstract
Gene regulatory networks (GRN) govern cellular identity and function through precise control of gene transcription. Single-cell technologies have provided powerful means to dissect regulatory mechanisms within specific cellular states. However, existing computational approaches for modeling single-cell RNA sequencing (scRNA-seq) data often infer local regulatory interactions independently, which limits their ability to resolve regulatory mechanisms from a global perspective. Here, we propose a deep learning framework (Planet) based on diffusion models for constructing cell-specific GRN, thereby providing a systems-level view of how protein regulators orchestrate transcriptional programs. Planet jointly optimizes local network structures in conjunction with gene expression profiles, thereby enhancing the structural consistency of the resulting networks at the global level. Specifically, Planet decomposes GRN generation into a series of Markovian evolution steps and introduces a Triple Hybrid-Attention Transformer to capture long-range regulatory dependencies across diffusion time-steps. Benchmarks on multiple scRNA-seq datasets demonstrate that Planet achieves competitive performance against state-of-the-art methods and yields only a slight improvement over DigNet under comparable conditions. Compared with conventional diffusion models that rely on fixed sampling schedules, Planet employs a fast-sampling strategy that accelerates inference with only minimal accuracy trade-off. When applied to mouse-lung Cd8(+)Gzmk(+) T cells, Planet successfully reconstructs a cell-type-specific GRN, recovers both established and previously uncharacterized regulators, and delineates the dynamic immunoregulatory changes that accompany ageing. Overall, Planet provides a practical framework for constructing cell-specific GRNs with improved global consistency, offering a complementary perspective to existing methods and new insights into regulatory dynamics in health and disease.