Linkage-aware inference of fitness from short-read time-series genomic data

基于短读长时间序列基因组数据的连锁感知适应度推断

阅读:2

Abstract

Inferring the fitness effect of mutations is a basic problem in understanding the evolution of populations over time. When multiple mutations are present in a population simultaneously, genetic linkage comes into play, and the fate of an individual mutation depends on both its fitness as well as the background on which it occurs. Accurate inference of fitness effects for evolutionary systems with multiple competing mutations is therefore contingent on resolving the confounding effects of genetic linkage, captured by the covariance between allele-pairs. Increasingly, evolutionary studies are using short-read sequencing technologies to produce detailed snapshots of evolving populations. This presents a problem as the frequencies of allele-pairs are not known beyond the read-length, hampering any attempt to resolve the effects of genetic linkage between pairs of loci residing on different reads. Here we present a computationally efficient pipeline for inferring selection from short-read time-series data with partial allele-pair frequency information, while accounting for linkage. Simulation results show that the method has good performance and is scalable to systems with several thousand variants. Additionally, we demonstrate the pipeline's utility on real datasets of within-host HIV and SARS-CoV-2 evolution, showcasing its applicability in resolving linkage effects from complex evolutionary histories.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。