Machine learning method for the prediction of Bedaquiline-resistant Mycobacterium tuberculosis

用于预测贝达喹啉耐药结核分枝杆菌的机器学习方法

阅读:2

Abstract

The study addresses the increasing resistance to the FDA-approved drug Bedaquiline (BDQ) in Mycobacterium tuberculosis (MTB). The absence of any defined resistance locus and the wide variation in the drug targets across clinical isolates have raised a big question about our understanding of the molecular basis of BDQ resistance acquisition. Using machine learning (ML) methods, BDQ resistance was predicted from whole-genome sequencing data for MTB clinical isolates. Variant calling format data generation involved several steps, including adapter trimming and alignment to the H37Rv reference genome. The ML models, namely, Multilayer Perceptron and Random Forest (RF), achieved high accuracies of 83.60% and 79.64%, respectively. The top 50 features were mapped to the H37Rv reference genome, and several new drug targets were identified. In addition to the coding regions, some non-coding intergenic regions were also obtained. Mapping of these features to the H37Rv genome revealed 15 new antibiotic-resistant genes. In addition, the use of explainable AI (XAI) methods, such as SHapley Additive exPlanations, facilitated the identification of mutations associated with BDQ resistance. In conclusion, the ML models demonstrated effective predictive capabilities for BDQ resistance, whereas XAI contributed to understanding key resistance features.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。