Abstract
Understanding how pathogens evolve is fundamental to disease control and is a basic question in evolutionary biology, yet pathogens with complex life cycles violate assumptions of classic evolutionary models. Genetic analyses of the malaria parasite Plasmodium falciparum have shown multiple surprising patterns. For example, empirical analyses produce effective population size estimates that vary by orders of magnitude depending on the method and an excess of genes with elevated nonsynonymous variation (measured as πN/πS ). Here, reanalyzing genomic data from 18 worldwide populations, I show that these observations directly follow from distributions of genetic variation enriched for rare variants. Multiple potential mechanisms may increase the proportion of rare variants, including host expansions, lifecycle dynamics, population structure, or selection. The observed genealogies are more consistent with a multiple-merger coalescent than a Kingman coalescent, causing common summary statistics to be biased in predictable directions. In particular, the abundance of rare variants interacts with the mathematical properties of ratio statistics to systematically inflate gene-level πN/πS estimates even in the absence of selection. Notably, filtering this rare variation reveals previously masked candidates for selection, including well-characterized antigens such as merozoite surface proteins. This framework provides a foundation for interpreting genomic data in pathogens with high variance in reproductive success.