Improving Nanopore sequencing-based core genome MLST for global infection control: a strategy for GC-rich pathogens like Burkholderia pseudomallei

改进基于纳米孔测序的核心基因组多位点序列分型以控制全球感染:针对富含GC含量病原体(如类鼻疽伯克霍尔德菌)的策略

阅读:1

Abstract

Genomic surveillance of pathogens is essential to trace infections and analyze resistance markers. Core genome multilocus sequence typing (cgMLST) facilitates genomic surveillance by simplified analysis and standardization. However, its application is limited by the poor cost-efficiency of short-read (SR) sequencing. Oxford Nanopore long-read sequencing (ONT-LR), which allows fast on-site analysis with comparatively low costs, could provide an alternative. Despite ONT-LR raw read accuracy improvement, evidence for methylation-based errors accumulates. PCR-based library preparation, suggested as a solution, presumably poses difficulties for GC-rich bacteria. We challenged ONT-LR-based cgMLST using the highly GC-rich pathogen Burkholderia pseudomallei to develop a clinically applicable workflow. Our B. pseudomallei cgMLST scheme was applied to ONT-LR data, and the results were validated against SR data. Native, rapid, and PCR-based library preparation was performed and combined with different basecalling models (SUP@bacterial-methylation, SUP@v4.2, SUP@v4.3, and SUP@v5.0) and polishing strategies (medaka_consensus, medaka_variant, r103_min_high_g360). To ensure reliability across genotypes, we included 14 sequence types and 27 genotypes. The recommended ONT-LR workflow at study initiation (SUP@v4.2, medaka_consensus) showed nearly 200 allele differences compared with the reference for specific strains. PCR-based library preparation resulted in missing targets and typing errors of up to 21 alleles. Native barcoding with SUP@v5.0 basecalling and r103_min_high_g360 polishing outperformed the PCR-based approach in all parameters reducing the error rate to a maximum of two allele differences. The optimized ONT-LR-based cgMLST workflow for B. pseudomallei integrates high resolution and ease of implementation with enhanced cost-efficiency for rapid diagnostics. The developed protocol might serve as a guideline for other GC-rich pathogens. IMPORTANCE: This study highlights a significant advancement in genomic surveillance of bacterial pathogens, specifically addressing the challenges posed by the GC-rich species Burkholderia pseudomallei. Core genome multilocus sequence typing (cgMLST) is widely used for bacterial typing as it combines high resolution with simple implementation and standardization. To improve cost efficiency and thus accessibility, we changed the sequencing approach from Illumina short-read (SR) to Oxford Nanopore long-read sequencing (ONT-LR). ONT-LR-based cgMLST showed a very high error rate compared with SR-based cgMLST, most likely due to methylation-associated errors. PCR-based library preparation, which is proposed to correct these errors, did not achieve the required accuracy. In contrast, native barcoding with advanced basecalling and polishing strategies massively reduces allelic differences. This optimized ONT-LR cgMLST workflow provides a transformative solution for cost-efficient, high-resolution typing of B. pseudomallei. Furthermore, this study can serve as a guide for similarly challenging bacteria.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。