Abstract
Haplotyped-resolved phased assemblies aim to capture the full allelic diversity in heterozygous and polyploid species to enable accurate genetic analyses. However, building non-collapsed references still presents a challenge. Here, we used long-range interaction Hi-C reads (high-throughput chromatin conformation capture) and HiFi PacBio reads to assemble the genome of the apomictic cultivar Basilisks from Urochloa decumbens (2n = 4x = 36), an outcrossed tetraploid Paniceae grass widely cropped to feed livestock in the tropics. We identified and removed Hi-C reads between homologous unitigs to facilitate their scaffolding and employed methods for the manual curation of rearrangements and misassemblies. Our final phased assembly included the 4 haplotypes in 36 chromosomes. We found that 18 chromosomes originated from diploid Urochloa brizantha and the other 18 from either Urochloa ruziziensis or diploid U. decumbens. We also identified a chromosomal translocation between chromosomes 5 and 32, as well as evidence of pairing exclusively within subgenomes, except for a homoeologous exchange in chromosome 21. Our results demonstrate that haplotype-aware assemblies accurately capture the allelic diversity in heterozygous species, making them the preferred option over collapsed-haplotype assemblies.