'apparent': a simple and flexible R package for accurate SNP-based parentage analysis in the absence of guiding information

“apparent”:一个简单灵活的R包,用于在缺乏指导信息的情况下进行准确的基于SNP的亲子鉴定分析。

阅读:1

Abstract

BACKGROUND: The accurate determination of parent-progeny relationships within both in situ natural populations and ex situ genetic resource collections can greatly enhance plant breeding/domestication efforts and support plant genetic resource conservation strategies. Although a range of parentage analysis tools are available, none are designed to infer such relationships using genome-wide single nucleotide polymorphism (SNP) data in the complete absence of guiding information, such as generational groups, partial pedigrees, or genders. The R package ('apparent') developed and presented here addresses this gap. RESULTS: 'apparent' adopts a novel strategy of parentage analysis based on a test of genetic identity between a theoretically expected progeny (EP(ij)), whose genotypic state can be inferred at all homozygous loci for a pair of putative parents (i and j), and all potential offspring (PO(k)), represented by the k individuals of a given germplasm collection. Using the Gower Dissimilarity metric (GD), genetic identity between EP(ij) and PO(k) is taken as evidence that individuals i and j are the true parents of offspring k. Significance of a given triad (parental pair(ij) + offspring(k)) is evaluated relative to the distribution of all GD(ij|k) values for the population. With no guiding information provided, 'apparent' correctly identified the parental pairs of 15 lines of known pedigree within a test population of 77 accessions of Actinidia arguta, a performance unmatched by five other commonly used parentage analysis tools. In the case of an inconclusive triad analysis due to the absence of one parent from the test population, 'apparent' can perform a subsequent dyad analysis to identify a likely single parent for a given offspring. Average dyad analysis accuracy was 73.3% in the complete absence of pedigree information but increased to 100% when minimal generational information (adults vs. progeny) was provided. CONCLUSIONS: The 'apparent' R package is a fast and accurate parentage analysis tool that uses genome-wide SNP data to identify parent-progeny relationships within populations for which no a priori knowledge of family structure exists.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。