GeneCompass: deciphering universal gene regulatory mechanisms with a knowledge-informed cross-species foundation model

基因罗盘:利用基于知识的跨物种基础模型破译通用基因调控机制

阅读:3
作者:Xiaodong Yang # ,Guole Liu # ,Guihai Feng # ,Dechao Bu # ,Pengfei Wang # ,Jie Jiang # ,Shubai Chen # ,Qinmeng Yang # ,Hefan Miao ,Yiyang Zhang ,Zhenpeng Man ,Zhongming Liang ,Zichen Wang ,Yaning Li ,Zheng Li ,Yana Liu ,Yao Tian ,Wenhao Liu ,Cong Li ,Ao Li ,Jingxi Dong ,Zhilong Hu ,Chen Fang ,Lina Cui ,Zixu Deng ,Haiping Jiang ,Wentao Cui ,Jiahao Zhang ,Zhaohui Yang ,Handong Li ,Xingjian He ,Liqun Zhong ,Jiaheng Zhou ,Zijian Wang ,Qingqing Long ,Ping Xu ,Zhen Meng ,Xuezhi Wang ,Yangang Wang ,Yong Wang ,Shihua Zhang ,Jingtao Guo ,Yi Zhao ,Yuanchun Zhou ,Fei Li ,Jing Liu ,Yiqiang Chen ,Ge Yang ,Xin Li

Abstract

Deciphering universal gene regulatory mechanisms in diverse organisms holds great potential for advancing our knowledge of fundamental life processes and facilitating clinical applications. However, the traditional research paradigm primarily focuses on individual model organisms and does not integrate various cell types across species. Recent breakthroughs in single-cell sequencing and deep learning techniques present an unprecedented opportunity to address this challenge. In this study, we built an extensive dataset of over 120 million human and mouse single-cell transcriptomes. After data preprocessing, we obtained 101,768,420 single-cell transcriptomes and developed a knowledge-informed cross-species foundation model, named GeneCompass. During pre-training, GeneCompass effectively integrated four types of prior biological knowledge to enhance our understanding of gene regulatory mechanisms in a self-supervised manner. By fine-tuning for multiple downstream tasks, GeneCompass outperformed state-of-the-art models in diverse applications for a single species and unlocked new realms of cross-species biological investigations. We also employed GeneCompass to search for key factors associated with cell fate transition and showed that the predicted candidate genes could successfully induce the differentiation of human embryonic stem cells into the gonadal fate. Overall, GeneCompass demonstrates the advantages of using artificial intelligence technology to decipher universal gene regulatory mechanisms and shows tremendous potential for accelerating the discovery of critical cell fate regulators and candidate drug targets.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。