SAVANA: reliable analysis of somatic structural variants and copy number aberrations using long-read sequencing.

阅读:8
作者:Elrick Hillary, Sauer Carolin M, Espejo Valle-Inclan Jose, Trevers Katherine, Tanguy Melanie, Zumalave Sonia, De Noon Solange, Muyas Francesc, Cascão Rita, Afonso Angela, Rust Alistair G, Amary Fernanda, Tirabosco Roberto, Giess Adam, Freeman Timothy, Sosinsky Alona, Piculell Katherine, Miller David T, Faria Claudia C, Elgar Greg, Flanagan Adrienne M, Cortes-Ciriano Isidro
Accurate detection of somatic structural variants (SVs) and somatic copy number aberrations (SCNAs) is critical to study the mutational processes underpinning cancer evolution. Here we describe SAVANA, an algorithm designed to detect somatic SVs and SCNAs at single-haplotype resolution and estimate tumor purity and ploidy using long-read sequencing data with or without a germline control sample. We also establish best practices for benchmarking SV detection algorithms across the entire genome in a data-driven manner using replication and read-backed phasing analysis. Through the analysis of matched Illumina and nanopore whole-genome sequencing data for 99 human tumor-normal pairs, we show that SAVANA has significantly higher sensitivity and 13- and 82-times-higher specificity than the second and third-best performing algorithms. Moreover, SVs reported by SAVANA are highly consistent with those detected using short-read sequencing. In summary, SAVANA enables the application of long-read sequencing to detect SVs and SCNAs reliably.

特别声明

1、本文转载旨在传播信息,不代表本网站观点,亦不对其内容的真实性承担责任。

2、其他媒体、网站或个人若从本网站转载使用,必须保留本网站注明的“来源”,并自行承担包括版权在内的相关法律责任。

3、如作者不希望本文被转载,或需洽谈转载稿费等事宜,请及时与本网站联系。

4、此外,如需投稿,也可通过邮箱info@biocloudy.com与我们取得联系。