Functional (re)annotation of Mycobacteroides abscessus proteome using integrative sequence and AI-based structural approaches

利用整合序列和基于人工智能的结构方法对脓肿分枝杆菌蛋白质组进行功能(重新)注释

阅读:1

Abstract

Functional annotation of proteins is crucial in understanding the basic biology of organisms. In the context of pathogens, it can provide valuable insights towards its functional landscape contributing to understanding the molecular mechanisms of pathogenesis and survival. In this study, we explored the applications of sequence and AI-driven structure-based methods to functionally (re)annotate Mycobacteroides abscessus (MAB). MAB is an opportunistic pathogen responsible for causing infections in immunocompromised patients and exhibits resistance to several antibiotics. The global rise in drug-resistant strains and the recently identified potential for indirect human-to-human transmission emphasizes the importance of understanding MAB as a critical pathogen. However, there is a huge gap in our understanding of the MAB proteome, which is vital not only for understanding the functional aspects of various proteins but also for prioritizing drug targets for therapeutic development. Presently, 28 % of the MAB proteome, as available in UniProtKB, is poorly annotated, and more than a fourth of MAB proteome lack gene ontology (GO) terms, indicating a lack of standard functional descriptions. To this end, the present study aims to functionally (re)annotate MAB proteome using a combination of sequence and structure-based approaches in a systematic way. We performed sequence-based similarity search against NR database and performed HMM based search for functional domains with Pfam and CATH. Then, we utilized MAB AlphaFold-predicted structures to annotate MAB proteins with structure-based similarity search using Foldseek to identify proteins and transfer their gene ontology (GO) annotations. We assigned new GO annotations (374 proteins) and refined the existing annotations (885 proteins) for previously unannotated essential genes of MAB. In addition, we also performed annotations using an integrated sequence and structure-based approach for the 29 proteins for which AlphaFold structures were not available. In the end, structural comparisons of a few proteins that were similar to Mycobacterium tuberculosis were explored, revealing residue-level differences in MAB linked to drug resistance. Our study highlights a combined sequence- and AI-driven structure-based approach for large-scale proteome functional annotation, which can be applied to any organism of interest.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。