Transcriptome-conditioned molecule generation via gene interaction-aware fragment modeling with a GPT-based architecture

基于GPT架构的基因相互作用感知片段建模,实现转录组调控的分子生成

阅读:1

Abstract

Phenotype-driven drug discovery leverages cellular responses to guide the design of therapeutic molecules. Recent advancements in transcriptomics have provided extensive datasets describing how gene expression changes in response to various chemical stimuli, presenting an opportunity to directly link molecular generation to specific cellular phenotypes. However, effectively linking transcriptomic perturbations to chemical structure generation remains challenging due to the complexity of gene interactions and chemical feasibility constraints. We developed GGIFragGPT, a novel generative model that integrates transcriptomic perturbation profiles with biologically informed gene-gene interaction embeddings to guide fragment-based molecular generation. The model employs an autoregressive transformer architecture to sequentially assemble chemically valid fragments, with cross-attention mechanisms highlighting biologically relevant genes guiding the molecular generation process. Comparative analysis confirmed that the proposed approach yields chemically feasible, novel, and diverse molecules. By leveraging transcriptomic profiles, GGIFragGPT successfully generated compounds aligned with the biological context suggested by transcriptomic data, validated through gene-level interpretability analysis that identified key target genes. Case studies demonstrated the model's capability to produce structurally plausible inhibitors, exemplified by targeted molecule generation against CDK7. This work demonstrates the potential of integrating biological insights into chemical generation processes, offering a promising approach for phenotype-driven therapeutic discovery.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。