Mining and expanding high-quality genetic parts for synthetic biology and bioengineering are urgent needs in the research and development of next-generation biotechnology. However, gene mining has relied on sequence homology or ample expert knowledge, which fundamentally limits the establishment of a comprehensive genetic part catalog. In this work, we propose SYMPLEX (synthetic biological part mining platform by large language model-enabled knowledge extraction), a universal gene-mining platform based on large language models. We applied SYMPLEX to mine enzymes responsible for messenger RNA (mRNA) capping, a key process in eukaryotic posttranscriptional modification, and obtained thousands of diverse candidates with traceable evidence from biomedical literature and databases. Of the 46 experimentally tested integral capping enzyme candidates, 14 demonstrated in vivo cross-species capping activity, and 2 displayed superior in vitro activity over the commercial vaccinia capping enzymes currently used in mRNA vaccine production. SYMPLEX provides a distinct paradigm for functional gene mining and offers powerful tools to facilitate knowledge discovery in fundamental research.
Discovery of diverse and high-quality mRNA capping enzymes through a language model-enabled platform.
通过语言模型平台发现多种高质量的mRNA加帽酶
阅读:3
作者:Wang Tianze, Qin Bowen R, Li Sihong, Wang Zimo, Li Xuejian, Jiang Yuanxu, Qin Chenrui, Ouyang Qi, Lou Chunbo, Qian Long
| 期刊: | Science Advances | 影响因子: | 12.500 |
| 时间: | 2025 | 起止号: | 2025 Apr 11; 11(15):eadt0402 |
| doi: | 10.1126/sciadv.adt0402 | 研究方向: | 其它 |
特别声明
1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。
2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。
3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。
4、投稿及合作请联系:info@biocloudy.com。
