Multiscale conformational sampling of multidomain fusion proteins by a physics informed diffusion model

基于物理信息的扩散模型对多结构域融合蛋白进行多尺度构象采样

阅读:1

Abstract

Multidomain fusion proteins, such as bispecific antibodies, rely on highly flexible linker regions for their therapeutic efficacy. Characterizing these vast conformational ensembles is crucial for rational drug design; however, while all-atom molecular dynamics (MD) is the traditional gold standard, its immense computational cost makes simulating large-scale domain motions prohibitive. Recently, deep generative diffusion models have emerged as a rapid alternative for sampling protein dynamics. Yet, being trained primarily on massive databases of structured, static domains, these generic models often lack the biophysical constraints required to thoroughly sample the large-scale dynamics of highly flexible multidomain architectures. To overcome this, we leverage microsecond MD trajectories of a multidomain protein construct with various linkers to train a multiscale diffusion framework utilizing an Equivariant Graph Neural Network (EGNN). To efficiently model the dynamics of the large molecular complexes, we employ a coarse-grained spatial graph that condenses rigid domains into center-of-mass anchors while preserving explicit backbone resolution for the flexible linker. By further integrating foundational rules in biophysics directly into both the training objective and the inference process, our model generates high-fidelity conformational ensembles that reproduce the thermodynamic distributions of long-timescale MD. This physics-informed approach provides a mathematically stable, highly scalable platform for the rapid multiscale characterization of flexible biologics, significantly accelerating the rational design of fusion protein therapeutics.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。