Abstract
Diabetes mellitus (DM) is a global health concern, and dipeptidyl peptidase-4 (DPP-4) is a key therapeutic target. The study used three machine learning and deep learning models to predict potential DPP-4 inhibitors using a curated data set of 6,750 compounds. The models included support vector machine (SVM), random forest (RF), naive Bayes (NB), and multitask deep neural network (MTDNN). The MTDNN model demonstrated strong predictive performance, achieving 98.62% train accuracy and 98.42% test accuracy for predicting DPP-4 inhibitors and a correlation coefficient of 0.979 for training and 0.977 for the test data set, with low training and test errors while predicting corresponding IC(50) values. The MTDNN model predicted potential inhibitors using an external data set of FDA-approved drugs, identifying 100 compounds. Among these, five compounds stood out with promising molecular docking and dynamic profiles, suggesting their potential as repurposed drugs for targeting DPP-4 and offering hope for the future of diabetes treatment.