Development and Validation of a Machine Learning Model for Identifying Novel HIV Integrase Inhibitors

开发和验证用于识别新型HIV整合酶抑制剂的机器学习模型

阅读:3

Abstract

Human immunodeficiency virus (HIV) integrase inhibitors play a critical role in antiretroviral therapy, but the emergence of drug resistance necessitates the discovery of novel compounds. Machine learning (ML) offers a data-driven approach to accelerate drug discovery by predicting potential inhibitors with high efficacy. This study utilized a curated dataset of known HIV integrase inhibitors and employed feature engineering techniques to extract molecular descriptors. Random forest and logistic regression models were trained to classify compounds based on their inhibitory potential. Model performance was evaluated using accuracy, precision, recall, and the area under the receiver operating characteristic curve (AUC-ROC). The random forest model demonstrated superior predictive performance, achieving an AUC-ROC of 0.886, an accuracy of 0.815, and a precision of 0.79. Key molecular features, including hydrogen bond donors, rotatable bonds, and molecular weight, were identified as crucial determinants of inhibition. The models successfully screened novel compounds with high predicted inhibitory potential. Machine learning (ML) provides a powerful tool for the rapid identification of potential HIV integrase inhibitors. This study highlights the importance of molecular descriptors in predicting inhibitory activity and demonstrates the feasibility of ML-driven drug discovery. Future work will focus on refining model generalization, expanding datasets, and developing a user-friendly platform via Streamlit to enhance accessibility for researchers and drug developers.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。