MiST: rapid, accurate and flexible (core-genome) multi-locus sequence typing (MLST) allele calling from draft genomes

MiST:基于基因组草图的快速、准确、灵活的(核心基因组)多位点序列分型(MLST)等位基因分型

阅读:1

Abstract

BACKGROUND: Multi-locus sequence typing (MLST) is a typing method to differentiate bacteria based on the sequence of several housekeeping genes. Identifiers are assigned to unique allele sequences for each locus in the scheme, and the combination of these identifiers defines a genetic profile. Its reproducibility and portability across laboratories have made it a staple typing method, widely used for epidemiological and evolutionary analyses. Although MLST was traditionally based on Sanger sequencing, the method is still widely used in the era of whole-genome sequencing (WGS). Moreover, WGS has made core-genome MLST (cgMLST) possible, which scale up MLST to hundreds or thousands of loci across the genome. Conventional and cgMLST schemes are publicly available on various platforms such as PubMLST.org, BIGSdb Institut Pasteur, cgMLST.org, and EnteroBase. However, the available software for (offline) cgMLST allele calling is often not flexible to accommodate schemes from diverse sources and/or lacks the computational scalability required to efficiently process larger schemes. RESULTS: In this study, we present Minimap2-inferred Sequence Typing (MiST), a rapid and flexible cgMLST allele caller that is low-resource intensive and can easily accommodate schemes from different sources. We benchmarked the tool against other available MLST and cgMLST calling software using WGS data from five different species and schemes collected from various sources. We demonstrate that MiST can accurately identify alleles while requiring substantially fewer computational resources than existing allele callers. CONCLUSION: MiST can help to make cgMLST analysis more accessible for integration into local bioinformatics workflows. MiST is available as an open-source Python package under the GPLv3 license at https://github.com/BioinformaticsPlatformWIV-ISP/MiST , and can be installed via Pip or Conda.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。