DeepBindRG: a deep learning based method for estimating effective protein-ligand affinity

DeepBindRG:一种基于深度学习的蛋白质-配体有效亲和力评估方法

阅读:1

Abstract

Proteins interact with small molecules to modulate several important cellular functions. Many acute diseases were cured by small molecule binding in the active site of protein either by inhibition or activation. Currently, there are several docking programs to estimate the binding position and the binding orientation of protein-ligand complex. Many scoring functions were developed to estimate the binding strength and predict the effective protein-ligand binding. While the accuracy of current scoring function is limited by several aspects, the solvent effect, entropy effect, and multibody effect are largely ignored in traditional machine learning methods. In this paper, we proposed a new deep neural network-based model named DeepBindRG to predict the binding affinity of protein-ligand complex, which learns all the effects, binding mode, and specificity implicitly by learning protein-ligand interface contact information from a large protein-ligand dataset. During the initial data processing step, the critical interface information was preserved to make sure the input is suitable for the proposed deep learning model. While validating our model on three independent datasets, DeepBindRG achieves root mean squared error (RMSE) value of pKa (-logK(d) or -logK(i)) about 1.6-1.8 and R value around 0.5-0.6, which is better than the autodock vina whose RMSE value is about 2.2-2.4 and R value is 0.42-0.57. We also explored the detailed reasons for the performance of DeepBindRG, especially for several failed cases by vina. Furthermore, DeepBindRG performed better for four challenging datasets from DUD.E database with no experimental protein-ligand complexes. The better performance of DeepBindRG than autodock vina in predicting protein-ligand binding affinity indicates that deep learning approach can greatly help with the drug discovery process. We also compare the performance of DeepBindRG with a 4D based deep learning method "pafnucy", the advantage and limitation of both methods have provided clues for improving the deep learning based protein-ligand prediction model in the future.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。