Finding disease genes: a fast and flexible approach for analyzing high-throughput data

寻找致病基因:一种快速灵活的高通量数据分析方法

阅读:1

Abstract

Linkage disequilibrium (LD) is the non-random distribution of alleles across the genome, and it can create serious problems for modern linkage studies. In particular, computational feasibility is often obtained at the expense of power, precision, and/or accuracy. In our new approach, we combine linkage results over multiple marker subsets to provide fast, efficient, and robust analyses, without compromising power, precision, or accuracy. Allele frequencies and LD in the densely spaced markers are used to construct subsamples that are highly informative for linkage. We have tested our approach extensively, and implemented it in the software package EAGLET (Efficient Analysis of Genetic Linkage: Estimation and Testing). Relative to several commonly used methods we show that EAGLET has increased power to detect disease genes across a range of trait models, LD patterns, and family structures using both simulated and real data. In particular, when the underlying LD pattern is derived from real data, we find that EAGLET outperforms several commonly used linkage methods. In-depth analysis of family data, simulated with linkage and under the real-data derived LD pattern, showed that EAGLET had 78.1% power to detect a dominant disease with incomplete penetrance, whereas the method that uses one marker per cM had 69.7% power, and the cluster-based approach implemented in MERLIN had 76.7% power. In this same setting, EAGLET was three times faster than MERLIN, and it narrowed the MERLIN-based confidence interval for trait location by 29%. Overall, EAGLET gives researchers a fast, accurate, and powerful new tool for analyzing high-throughput linkage data, and large extended families are easily accommodated.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。