Leveraging single-cell foundation models for accurate survival outcome prediction

利用单细胞基础模型进行准确的生存结果预测

阅读:1

Abstract

MOTIVATION: Foundation models trained on large-scale single-cell transcriptomes can capture rich molecular representations of cellular states, yet their potential for cancer survival prediction from bulk RNA-seq data remains largely unexplored. RESULTS: We applied the single-cell foundation model scFoundation to derive patient-level embeddings across 25 cancer types from TCGA and systematically evaluated their prognostic value under both cancer-specific and pan-cancer settings. To leverage complementary information, we developed an Embedding-Gene-Survival Prediction (EGSP) model that integrates foundation model embeddings with gene expression and clinical variables. EGSP achieved a mean concordance index (C-index) of 0.724 across cancers and exceeded 0.8 in seven cancer types, consistently outperforming single-modality models and existing multi-omics survival approaches. Comparative analyses showed that embeddings derived from pretrained scFoundation weights exhibited lower redundancy with gene expression while retaining complementary prognostic signals relative to pan-cancer fine-tuned embeddings. Explainable AI analyses further revealed that prognostic embeddings capture interpretable biological programs related to tumor differentiation, immune activity, and tumor-intrinsic growth, enabling transparent survival prediction at both cohort and patient levels. Overall, single-cell foundation model embeddings provide biologically meaningful and partially non-redundant survival signals that substantially improve bulk RNA-seq-based prognostic modeling. AVAILABILITY AND IMPLEMENTATION: https://github.com/weiliu123/EGSP.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。