Retention order prediction of peptides containing non-proteinogenic amino acids

含非蛋白源氨基酸的肽的保留顺序预测

阅读:2

Abstract

MOTIVATION: Peptides containing non-proteinogenic amino acids (PNPAs) are promising targets in drug development for their unique pharmacological properties. The lack of their mass spectra or retention data has been hindering PNPA research, where accurate assessment of their retention time in chromatography is crucial for identifying structures and characterizing functions. Conventional methods are often ineffective due to limited amount of data. This study aims to predict their retention order, not absolute time, from structures by using data from peptides and small molecules. This approach can advance natural product identification and drug research. RESULTS: Our model uses the Ranking Support Vector Machine, and successfully predicted the retention order of PNPA with an accuracy of over 0.9. Counting fingerprints and MIX fingerprint, which combines four types of fingerprints, were used as explanatory variables. To suppress the multi-collinearity, principal component analysis was applied to reduce spurious fingerprints. SHAP value analysis revealed that one component, derived from methyl groups, contributed most for the prediction. Overall, order prediction can effectively find candidate compounds from LC/MS data from non-conventional biological extracts. AVAILABILITY AND IMPLEMENTATION: https://github.com/ShoheiNakamukai/RO_prediction_of_PNPA/tree/main.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。