PerseuCPP: a machine learning strategy to predict cell-penetrating peptides and their uptake efficiency

PerseuCPP:一种利用机器学习预测细胞穿透肽及其摄取效率的策略

阅读:1

Abstract

MOTIVATION: Cell-penetrating peptides (CPPs) are promising tools for transporting therapeutic molecules into cells without damaging the cellular membrane. These peptides serve as efficient drug delivery systems, capable of carrying diverse biologically active substances while exhibiting low cytotoxicity compared to non-native molecules. However, identifying CPPs through experimental methods is expensive and time-consuming, making computational strategies an attractive alternative due to their cost-effectiveness and scalability. RESULTS: This study introduces PerseuCPP, a machine learning strategy designed to identify CPPs. Based on descriptors including physicochemical and structural properties as well as atomic composition, our strategy employs the Extremely Randomized Trees to predict CPPs and their uptake efficiency. The first stage was developed using a balanced dataset of 967 CPPs and non-CPPs, applying a 10-fold cross-validation scheme. Two independent datasets were utilized for validation. The CPP predictor achieved superior results compared to state-of-the-art methods, with MCC 0.854, Recall 0.860, and AUC 0.984. The second stage, focused on efficiency prediction, was trained on a balanced dataset of 140 CPPs and non-CPPs, also using a 10-fold cross-validation scheme, and validated with an independent dataset. The efficiency predictor achieved competitive results, with Recall 0.761 and AUC 0.690. PerseuCPP is interpretable, offering insights into the key descriptors enabling peptides to penetrate cells effectively. We anticipate that PerseuCPP will be a valuable tool for advancing the design and application of CPPs in drug delivery and biomedical research. AVAILABILITY AND IMPLEMENTATION: https://github.com/goalmeida05/PERSEU.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。