CaMutQC: An R package for integrative quality control and filtration of cancer somatic mutations

CaMutQC:用于癌症体细胞突变综合质量控制和过滤的 R 软件包

阅读:1

Abstract

The quality control and filtration of cancer somatic mutations (CAMs), including the elimination of false positives due to technical bias and the selection of key mutation candidates, are crucial steps for downstream analysis in cancer genomics. However, due to diverse needs and the lack of standardized filtering criteria, the filtering strategies applied vary from study to study, often resulting in reduced efficiency, accuracy, and reproducibility. Here, we present CaMutQC, a heuristic quality control and soft-filtering R/Bioconductor package designed specifically for CAMs. CaMutQC enables users to remove false positive mutations, select potential mutation candidates, and estimate Tumor Mutation Burden (TMB) with a single line of code, using either default or customized parameters. A filter report and a code log can also be generated after the filtration process to facilitate reproducibility and comparison. The application of CaMutQC to a Whole-exome Sequencing (WES) benchmark dataset demonstrated its strong capability by eliminating 85.55 % of false positive Single nucleotide variants (SNVs) while retaining 90.72 % of true positive SNVs. Additionally, an additional 11.56 % of true positive SNVs were rescued through CaMutQC's built-in union strategy. Similar results were observed for Insertions and Deletions (INDELs). CaMutQC is freely available through Bioconductor at https://bioconductor.org/packages/CaMutQC/ under the GPL v3 license.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。