Abstract
Cell-to-cell communication (CCC) facilitates the coordination of various cellular behaviors in multicellular organisms. Many computational methods neglect downstream intracellular signaling and are limited by static and predefined ligand-receptor (L-R) databases. To address these limitations, we present CELLetter, a deep learning framework to identify potential L-R interactions through a novel feature learning model and decipher cellular signaling by integrating L-R co-expression with downstream transcription factor (TF) activity inferred from gene regulatory network. CELLetter begins by leveraging the protein large language model, ProstT5, for feature embedding. It then employs a dual-stream architecture for feature extraction and dimensionality reduction, a gate mechanism with dynamic weight adjustment for feature fusion, absolute difference, and element-wise product for feature interaction. After that, CELLetter combines interacting L-R pairs, single-cell RNA sequencing (scRNA-seq) data, and downstream TF activity to quantify communication strength. We comprehensively evaluated CELLetter using 11 evaluation metrics, benchmarking it against 4 state-of-the-art L-R classification models, 6 L-R validation tools, 10 CCC inference methods. CELLetter demonstrated superior L-R classification performance. Notably, we introduced a novel multi-faceted validation strategy employing colocalization distance, co-expression ratio, and co-detection probability on spatial transcriptomics data from human heart and distal lung epithelial tissues. CELLetter's predicted L-R pairs exhibited significant spatial relevance compared with other baselines. When applied to human head and neck squamous cell carcinoma (HNSCC) data, CELLetter produced CCC inferences broadly consistent with established methods. More importantly, ligand macrophage migration inhibitory factor (MIF) and receptor CD44 were predicted as a central signaling axis within HNSCC tumor microenvironment, suggesting their potentials as therapeutic targets . CELLetter is freely available at https://github.com/plhhnu/CELLetter.