Comparison of two-stage methods for count data in Mendelian randomization: a simulation study

孟德尔随机化中计数数据两阶段方法的比较:一项模拟研究

阅读:2

Abstract

Aim: Mendelian randomization (MR) is an instrumental variable (IV) method that utilizes genetic variants to establish causality between risk factors and outcomes in observational studies. These methods were primarily developed under assumptions appropriate for continuous or approximately normally distributed variables. However, in many biomedical and clinical studies, exposures and outcomes are naturally recorded as counts, such as the number of disease episodes or clinical events. Despite this, two-stage MR methods are applied to count data without a clear understanding of their validity under such settings. While individual-level MR methods like two-stage predictor substitution (TSPS) and two-stage residual inclusion (TSRI) are common, their comparative performance for count exposures and outcomes remains unclear. Materials & methods: We conducted the first systematic evaluation of TSPS and TSRI for count data using Poisson and negative binomial models across realistic MR scenarios. Simulations varied instrument strength (IS), confounding, sample size and also focused on invalid instruments. Performance was assessed by bias, root mean square error (RMSE), 95% confidence interval (CI) coverage, CI width and Type I error rate. To demonstrate practical application, we applied these methods to investigate the causal relationship between alcohol consumption and gout attacks using empirical data. Results: Our results revealed that across all scenarios, TSRI with the Poisson model produced the most stable estimates with lower bias and RMSE. TSRI achieved near-nominal coverage and narrower CI across varying IS and confounding levels, maintaining Type I error close to 0.05. IS significantly impacted performance, with IS = 0.5 yielding estimates closer to the true values, while weaker instruments (IS = 0.1) led to higher RMSE and bias. Increasing sample size in the presence of invalid and weak genetic variants increased the bias. In additional simulations with multiple weak instruments, TSRI continued to outperform TSPS, yielding lower bias or RMSE, narrow CI width and near-nominal coverage across sample sizes. In our application, alcohol consumption was causally associated with an estimated 11.6-12.7% increase in the expected number of gout attacks per year per unit increase in alcohol intake, although the presence of an invalid single nucleotide polymorphism likely biased this estimate. Conclusion: This study advances MR methodology by clarifying how TSPS and TSRI behave with count exposures and outcomes, providing practical guidance for valid instrument selection and reliable causal inference in MR studies involving count data.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。