An empirical assessment of differential privacy in real-world observational data: a case-control study of asthma exacerbation in UK Biobank linked with electronic health records

对真实世界观察数据中差分隐私的实证评估:一项基于英国生物银行哮喘急性发作病例对照研究,该研究与电子健康记录相关联

阅读:2

Abstract

OBJECTIVES: Electronic health records (EHRs) provide substantial resources for observational studies, yet present significant challenges in safeguarding patient privacy while maintaining research quality. Differential privacy (DP) offers a quantifiable privacy guarantee; however, its impact on observational studies remains underexplored. We empirically evaluated the effects of DP across varying values of its privacy parameter, epsilon, on case-control analysis outcomes using EHR data. This study aims to inform DP parameter selection and examines the influence of study characteristics on differentially private observational studies. MATERIALS AND METHODS: We assessed the effects of DP on a case-control study of 1-year asthma exacerbations, including 22 165 participants with a history of asthma from UK Biobank linked to EHR data. Odds ratios (ORs) for sociodemographic factors and comorbidities were analyzed using adjusted and propensity score-matched models across epsilon values. RESULTS: DP influenced the magnitude, direction, and statistical significance of ORs, occasionally resembling patterns of misclassification, residual confounding, and false-positive bias. Rare and imbalanced covariates showed greater OR variability, especially in matched studies. Epsilons smaller than ln(2) led to noticeable OR fluctuations. DISCUSSION: The impact of DP on ORs and selection of an optimal epsilon depends on sample size, covariate prevalence, confounders, case-to-control ratios in propensity score matching, mitigation of random seed p-hacking, and trust models. CONCLUSION: The effects of DP on ORs are highly context-dependent. In this study, epsilon values below ln(2) led to unstable ORs across random seeds. Averaging results or using predetermined seeds may help reduce variability and mitigate p-hacking.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。