Uncovering the potential virulence factors of emerging pathogens using AI/ML-based tools: a case study in Emergomyces africanus

利用人工智能/机器学习工具揭示新发病原体的潜在毒力因子:以非洲新发霉菌为例

阅读:2

Abstract

BACKGROUND: We are currently in the era of artificial intelligence (AI), which has become deeply embedded across nearly all scientific disciplines. Harnessing this revolutionary technology to predict virulence factors of emerging pathogens can improve our understanding of their pathogenicity, especially since the majority of these pathogens' proteomes are composed of hypothetical or uncharacterized proteins. Moreover, emerging orphan proteins were expressed from novel open reading frames. Therefore, this study aimed to develop a pipeline for predicting and annotating the species-specific secreted protein structures of these pathogens, with Emergomyces africanus selected as a model organism. METHODS: The proteome of E. africanus CBS 136260 was retrieved from the NCBI database. The secretome of this fungus was predicted by ML-based SignalP and Phobius tools, targeting signal peptide (SP) bearing proteins. Species-specific proteins were detected using BLASTp (sequence level) and AFDB clusters (structure level). AlphaFold2, an AI-based system, was used to build structural models of hypothetical proteins specific to Emergomyces. DeepFRI was used to anticipate functional annotation of these proteins based on their structures, while the DALI server was used to detect homologous similarity. Candidate proteins were applied to molecular docking analysis against MHC-II. RESULTS: The structure modeling and homologous matching revealed several protein domains similar to toxins (scorpion toxin-like, cytolysin, CARDS toxin, defensin-like), allergens, adhesins, hydrolytic enzymes, and inhibitors. Novel domains with putative functions (ion binding, proteolysis, transferase activity, and protein binding) were also discovered. In immunoinformatics and molecular docking studies, a cytolysin like-containing protein (Gene ID: ACJ72_08076) outperformed the other selected proteins in binding to MHC-II (Docking score = -318.74) with a confidence score = 0.96. CONCLUSION: The findings suggest that AI and ML tools can be employed in the preliminary stage to explore host-pathogen interactions and anticipate novel virulence genes.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。