Abstract
The ability of organisms to adapt and survive depends on the effects of genes and the environment on fitness. However, the multigenic nature of fitness traits and genotype-by-environment interactions hinder our ability to understand the genetic basis of fitness. Here, we established fitness prediction models for 35 environments using machine learning and existing fitness data and different types of genetic variants for a population of Saccharomyces cerevisiae isolates. Models revealed that the predictive ability of genetic variants varied across environments, with copy number variants explaining the majority of fitness variation in most cases. Model interpretation further showed that different variant types identified distinct sets of genes associated with predictive variants. These gene sets were significantly enriched in experimentally validated genes affecting fitness in only a subset of environments, indicating that many genes influencing fitness remain unexplored. Notably, non-experimentally validated genes were more important than validated ones for fitness predictions. Gene contributions to fitness predictions were both isolate and environment dependent, pointing to gene-by-gene and gene-by-environment interactions. Further interpretation of models uncovered experimentally validated and novel candidate genetic interactions for a well characterized stress, the fungicide benomyl. These findings highlight the feasibility of identifying the genetic basis of fitness by using different types of genetic variants and offer novel targets for future functional analysis.