Analyzing long-read CRISPR experiments with CRISPRLungo

使用 CRISPRLungo 分析长读长 CRISPR 实验

阅读:1

Abstract

Long-read sequencing can characterize complex genome editing-induced DNA sequence changes such as large deletions, insertions, and inversions that are difficult to detect using short-read sequencing. However, PCR amplification and sequencing errors complicate accurate variant detection, and existing analysis tools are not optimized for gene editing specific allelic outcomes. Here we present CRISPRLungo, a computational pipeline specifically designed for long-read amplicon sequencing of gene edited samples. CRISPRLungo incorporates unique molecular identifier (UMI)-based error correction and statistical filtering to distinguish true editing events from background noise, enabling robust detection of small indels and structural variants. Through systematic benchmarking using simulated datasets, we demonstrate that CRISPRLungo outperforms existing approaches in both accuracy and read recovery. CRISPRLungo supports both Oxford Nanopore and PacBio platforms and identify previously undetected structural variant edits such as inversions in published CRISPR datasets. To demonstrate allele-specific edit quantification, we applied CRISPRLungo to analyze edited primary cells from a patient with harboring compound heterozygous SBDS mutations, accurately quantifying SBDS editing outcomes despite contaminating reads from the homologous SBDSP1 pseudogene. To maximize accessibility, we developed a fully client-side web application requiring no installation, making advanced long-read analysis accessible to researchers regardless of computational expertise. CRISPRLungo is freely available at https://github.com/pinellolab/CRISPRLungo with a user-friendly web interface available at https://pinellolab.github.io/CRISPRLungo.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。