Assessment for antibiotic resistance in Helicobacter pylori: A practical and interpretable machine learning model based on genome-wide genetic variation

幽门螺杆菌抗生素耐药性评估:基于全基因组遗传变异的实用且可解释的机器学习模型

阅读:1

Abstract

Helicobacter pylori (H. pylori) antibiotic resistance poses a global health threat. Accurate identification of antibiotic resistant strains is essential for the control of infection. In the present study, our goal is to leverage the whole-genome data of H. pylori to develop practical and interpretable machine learning (ML) models for comprehensive antibiotic resistance assessment. A total of 296 H. pylori isolates with genome-wide data were downloaded from the Bacterial and Viral Bioinformatics Resource Center (BV-BRC) and the National Center for Biotechnology Information (NCBI) databases. By training ML models on feature sets of single nucleotide polymorphisms from SNP calling (SNPs-1), antibiotic-resistance SNP annotated by the Comprehensive Antibiotic Resistance Database (SNPs-2), gene presence or absence (GPA), we generated predictive models for four antibiotics and multidrug-resistance (MDR). Among them, the models that combined SNPs-1, SNPs-2, and GPA data demonstrated the best performance, with the eXtreme Gradient Boosting (XGBoost) consistently outperforming others. And then we utilized the SHapley Additive exPlanations (SHAP) method to interpret the ML models. Furthermore, a free web application for the MDR model was deployed to the GitHub repository (https://H.pylori/MDR/App/). Our study demonstrated the promise of employing whole-genome data in conjunction with ML algorithms to forecast H. pylori antibiotic resistance. In the future, the application of this approach for predicting H. pylori antibiotic resistance would hold the potential to mitigate the empiric administration.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。