Structure-based prediction of SARS-CoV-2 variant properties using machine learning on mutational neighborhoods

基于突变邻域的机器学习对SARS-CoV-2变异株特性进行结构预测

阅读:2

Abstract

This dataset presents a structure-enriched resource of theoretical and empirical SARS-CoV-2 spike receptor-binding domain (RBD) variants, developed under the STAYAHEAD project for pandemic preparedness. It integrates large-scale in silico structure predictions with empirical biophysical measurements. The dataset includes 3,705 single-point Wuhan-Hu-1 RBD variants and 100 higher-order Omicron BA.1/BA.2 variants, annotated with AlphaFold2 and ESMFold metrics and Bio2Byte sequence-based predictors. Structural descriptors-RMSD, TM-score, plDDT, solvent accessibility, hydrophobicity, aggregation propensity-are linked to ACE2 binding and expression data from deep mutational scanning. Provided as a FAIR(2) Data Package, it supports structure-function analysis, variant modeling, and responsible reuse in virology, structural biology, and computational protein science. This collaboration was co-funded by the PPP Allowance from Health ∼ Holland, Top Sector Life Sciences and Health, to stimulate public-private partnerships.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。