Abstract
BACKGROUND: Taxus wallichiana is an important species for paclitaxel production. Previous genome versions for Taxus spp. have been limited by extensive gaps, hindering the complete annotation and mining of paclitaxel (known as Taxol commercially) synthesis pathway-related genes. RESULTS: Here, we present the first phased high-quality reference genome of T. wallichiana, which significantly improves assembly quality and corrects large-scale assembly errors present in previous versions. The 2 haplotypes are 9.87 Gb and 9.98 Gb in length, respectively, and all 24 chromosomes were assembled with telomeres at both ends. Based on this high-quality genome (TWv1), we inferred that the candidate sex chromosome of T. wallichiana is chr12, and its sex determination system may follow a ZW model. Particularly, we identified and experimentally validated a batch of 2-oxoglutarate/Fe(II)-dependent dioxygenases (ODDs), which may be key C4β-C20 epoxidases in the paclitaxel synthesis pathway. CONCLUSIONS: This study not only provides a valuable data resource for gene mining in the biosynthetic pathways of secondary metabolites, such as paclitaxel, but also offers the highest-quality reference genome of gymnosperms to date for the identification of sex chromosomes, facilitating comparative genomic studies among gymnosperms.