Hybrid analysis with phylogeny and population modeling to estimate the recent founding date of a population: A case study in the origins of COVID-19 illustrates how a branching process approximation can simplify a hybrid analysis

结合系统发育和种群建模的混合分析来估计种群的近期起源日期:以新冠病毒起源为例,说明分支过程近似如何简化混合分析

阅读:1

Abstract

The exact date of the primary infection in COVID-19 remains unknown. One influential article (Pekar et al. (2021)) estimated the date with a hybrid analysis combining epidemiological and phylogenetic methods. The phylogenetic methods analyzed 583 SARS-COV-2 complete genomes to estimate the sample tMRCA (time of the most recent common ancestor). Before igniting as an epidemic, however, COVID-19 may have had several population bottlenecks with only a single infected person, so the MRCA merely represents the last such bottleneck. Pekar et al. (2021) therefore used epidemiological methods to estimate the time from the primary infection to the sample MRCA. The hybrid method involved several arbitrary decisions, however, reflecting the fact that the epidemiological and phylogenetic analyses overlap at the sample MRCA and are generally probabilistically dependent. Towards removing the dependence, note that the start of an epidemic has a branching process approximation. Let the branching process have a single ancestor. If the branching process does not go extinct, define skeleton particles (individuals) to be particles whose lineages do not go extinct, and define the long-time MRCA as the earliest skeleton particle with at least two skeleton offspring. A linear phylogeny of skeleton particles therefore separates the ancestor from the long-time MRCA. Probabilistically, the linear phylogeny is a defective renewal process of skeleton particles, making the generation count geometrically distributed. Moreover, the terminology "long-time MRCA" is apt, because as time becomes arbitrarily large, the MRCA of the corresponding extant population approaches the long-time MRCA. Effectively, the focus on the long-time MRCA makes the forward epidemiological and backward phylogenetic analyses probabilistically independent. The present article can therefore confirm most of the epidemiological conclusions of the hybrid analysis of Pekar et al. (2021). Its use of branching process approximations also points the way to noticeable simplifications in the hybrid method.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。