Structure prediction for proteins lacking homologous templates in the Protein Data Bank (PDB) remains a significant unsolved problem. We developed a protocol, C-I-TASSER, to integrate interresidue contact maps from deep neural-network learning with the cutting-edge I-TASSER fragment assembly simulations. Large-scale benchmark tests showed that C-I-TASSER can fold more than twice the number of non-homologous proteins than the I-TASSER, which does not use contacts. When applied to a folding experiment on 8,266 unsolved Pfam families, C-I-TASSER successfully folded 4,162 domain families, including 504 folds that are not found in the PDB. Furthermore, it created correct folds for 85% of proteins in the SARS-CoV-2 genome, despite the quick mutation rate of the virus and sparse sequence profiles. The results demonstrated the critical importance of coupling whole-genome and metagenome-based evolutionary information with optimal structure assembly simulations for solving the problem of non-homologous protein structure prediction.
Folding non-homologous proteins by coupling deep-learning contact maps with I-TASSER assembly simulations.
通过将深度学习接触图与 I-TASSER 组装模拟相结合来折叠非同源蛋白质
阅读:5
作者:Zheng Wei, Zhang Chengxin, Li Yang, Pearce Robin, Bell Eric W, Zhang Yang
| 期刊: | Cell Reports Methods | 影响因子: | 4.500 |
| 时间: | 2021 | 起止号: | 2021 Jul 26; 1(3):100014 |
| doi: | 10.1016/j.crmeth.2021.100014 | 研究方向: | 免疫/内分泌 |
特别声明
1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。
2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。
3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。
4、投稿及合作请联系:info@biocloudy.com。
