Detection of PCR chimeras in adaptive immune receptor repertoire sequencing using hidden Markov models

利用隐马尔可夫模型检测适应性免疫受体库测序中的PCR嵌合体

阅读:1

Abstract

Adaptive Immune Receptor Repertoire sequencing (AIRR-seq) has emerged as a central approach for studying T cell and B cell receptor populations, and is now an important component of studies of autoimmunity, immune responses to pathogens, vaccines, allergens, and cancers, and for antibody discovery. When amplifying the rearranged V(D)J genes encoding antigen receptors, each cycle of the Polymerase Chain Reaction (PCR) can produce spurious "chimeric" hybrids of two or more different template sequences. While the generation of chimeras is well understood in bacterial and viral sequencing, and there are dedicated tools to detect such sequences in bacterial and viral datasets, this is not the case for AIRR-seq. Further, the process that results in immune receptor sequences has domain-specific challenges, such as somatic hypermutation (SHM), and domain-specific opportunities, such as relatively well-known germline gene "reference" sequences. Here we describe CHMMAIRRa, a hidden Markov model for detecting chimeric sequences in AIRR-seq data, that specifically models SHM and incorporates germline reference sequences. We use simulations to characterize the performance of CHMMAIRRa and compare it to existing methods from other domains, we test the effect of PCR conditions on chimerism using IgM libraries generated in this study, and we apply CHMMAIRRa to four published AIRR-seq datasets to show the extent and impact of artifactual chimerism.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。