Abstract
Hookworm infection is a neglected tropical disease affecting hundreds of thousands of people annually in the tropics and sub-tropics. Population genomic approaches have the potential to improve our understanding of hookworm infection dynamics and control efficacy. Here, we validate an approach to generate genome-wide polymorphism datasets from accessible sample types in the zoonotic hookworm (Ancylostoma ceylanicum) then apply the validated approach in the human hookworm (Necator americanus) to compare laboratory- and field-derived samples. We first present an optimized method for purifying nucleic acid from individual third-stage hookworm larvae (L3s). We then measure the accuracy of variant call datasets generated through whole genome amplification (WGA) and next-generation sequencing (NGS). We demonstrate that WGA via multiple displacement amplification (MDA) introduces predictable biases that are exacerbated by low inputs and poor sample preservation but show that with sufficient input mass (≥0.1ng) we are still able to produce highly accurate variant call datasets from nucleic acid concentrations that reflect those of individual L3s. Using our validated approach, we infer laboratory- and field-collected samples of N. americanus as distinct populations, with higher levels of heterozygosity and nucleotide diversity identified in field-collected samples, suggesting signatures of inbreeding and/or drift are detectable in laboratory specimens within several years of initiation of infection. We also show that, despite expected reductions in heterozygosity, laboratory samples still possess numerous heterozygous sites, and we demonstrate that a reference genome generated from an adult worm from an early laboratory passage performs well for variant calling in both laboratory- and field-derived samples. Moving forward, our optimized method for nucleic acid purification can be broadly applied to generate input for any amplification-based approach where sequencing individual hookworm L3s, rather than a pool of specimens, is preferred. Our validated population genomics workflow can be used to characterize structure and connectivity of hookworm populations in endemic communities, with the goal of leveraging these insights to improve our approach to hookworm treatment and control.