Improvements in genome sequencing and assembly are enabling high-quality reference genomes for all species. However, the assembly process is still laborious, computationally and technically demanding, lacks standards for reproducibility, and is not readily scalable. Here we present the latest Vertebrate Genomes Project assembly pipeline and demonstrate that it delivers high-quality reference genomes at scale across a set of vertebrate species arising over the last ~500 million years. The pipeline is versatile and combines PacBio HiFi long-reads and Hi-C-based haplotype phasing in a new graph-based paradigm. Standardized quality control is performed automatically to troubleshoot assembly issues and assess biological complexities. We make the pipeline freely accessible through Galaxy, accommodating researchers even without local computational resources and enhanced reproducibility by democratizing the training and assembly process. We demonstrate the flexibility and reliability of the pipeline by assembling reference genomes for 51 vertebrate species from major taxonomic groups (fish, amphibians, reptiles, birds, and mammals).
Scalable, accessible, and reproducible reference genome assembly and evaluation in Galaxy.
阅读:8
作者:Larivière Delphine, Abueg Linelle, Brajuka Nadolina, Gallardo-Alba Cristóbal, Grüning Bjorn, Ko Byung June, Ostrovsky Alex, Palmada-Flores Marc, Pickett Brandon D, Rabbani Keon, Balacco Jennifer R, Chaisson Mark, Cheng Haoyu, Collins Joanna, Denisova Alexandra, Fedrigo Olivier, Gallo Guido Roberto, Giani Alice Maria, Gooder Grenville MacDonald, Jain Nivesh, Johnson Cassidy, Kim Heebal, Lee Chul, Marques-Bonet Tomas, O'Toole Brian, Rhie Arang, Secomandi Simona, Sozzoni Marcella, Tilley Tatiana, Uliano-Silva Marcela, van den Beek Marius, Waterhouse Robert M, Phillippy Adam M, Jarvis Erich D, Schatz Michael C, Nekrutenko Anton, Formenti Giulio
| 期刊: | bioRxiv | 影响因子: | 0.000 |
| 时间: | 2023 | 起止号: | 2023 Jun 30 |
| doi: | 10.1101/2023.06.28.546576 | ||
特别声明
1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。
2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。
3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。
4、投稿及合作请联系:info@biocloudy.com。
