DMAPLM: A multimodal pretrained framework for computational drug repositioning

DMAPLM:一种用于计算药物重定位的多模态预训练框架

阅读:4

Abstract

Drug repositioning offers an efficient route to discover new therapeutic indications for existing drugs. However, current computational drug repositioning models often face challenges related to data scarcity, heterogeneity, and therefore limited generalizability. To address these limitations, this study introduces DMAPLM, a multimodal pretrained framework for predicting drug-disease associations for further drug repositioning screening. DMAPLM leverages a lightweight dual-encoder architecture, utilizing ChemBERTa-2 for molecular encoding of drug SMILES strings and BioBERT for semantic encoding of multi-field disease texts. The framework explicitly aligns drug and disease representations through contrastive learning and employs attention-weighted pooling to emphasize informative molecular substructures. A Random Forest classifier is finally used for association prediction based on the enhanced multimodal features. We compile a new and comprehensive benchmark dataset for performance evaluation. Extensive experiments demonstrate that DMAPLM significantly outperforms six state-of-the-art baseline models, achieving an AUROC of 0.8919 and AUPR of 0.9116 under five-fold cross-validation, representing an improvement of up to 9%. Furthermore, DMAPLM exhibits robust performance in challenging cold-start scenarios, highlighting its practical utility for identifying novel drug-disease relationships. Case studies along with molecular docking analysis confirm the interpretability and biological meaningfulness of our predictions. Our study provides a powerful and interpretable approach for computational drug repositioning.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。