Discussion
We provide new RNA-seq data sets for 24 species of vascular plants in Harvard Forest. Challenges associated with this type of study included recovery of high-quality RNA from diverse species and access to NEON sites for genomic sampling. Overcoming these challenges offers opportunities for large-scale studies at the intersection of ecology and genomics.
Methods
We generated >650 Gbp of RNA-seq for 24 vascular plant species representing 12 genera and nine families at the Harvard Forest NEON site. Each species was sampled twice in 2016 (July and August). We assessed transcriptome quality and content with TransRate, BUSCO, and Gene Ontology annotations.
Results
Only modest differences in assembly quality were observed across multiple k-mers. On average, transcriptomes contained hits to >70% of loci in the BUSCO database. We found no significant difference in the number of assembled and annotated transcripts between diploid and polyploid transcriptomes.
