Privacy protection is a core principle of genomic but not proteomic research. We identified independent single nucleotide polymorphism (SNP) quantitative trait loci (pQTL) from COPDGene and Jackson Heart Study (JHS), calculated continuous protein level genotype probabilities, and then applied a naïve Bayesian approach to link SomaScan 1.3K proteomes to genomes for 2812 independent subjects from COPDGene, JHS, SubPopulations and InteRmediate Outcome Measures In COPD Study (SPIROMICS) and Multi-Ethnic Study of Atherosclerosis (MESA). We correctly linked 90-95% of proteomes to their correct genome and for 95-99% we identify the 1% most likely links. The linking accuracy in subjects with African ancestry was lower (~â60%) unless training included diverse subjects. With larger profiling (SomaScan 5K) in the Atherosclerosis Risk Communities (ARIC) correct identification wasâ>â99% even in mixed ancestry populations. We also linked proteomes-to-proteomes and used the proteome only to determine features such as sex, ancestry, and first-degree relatives. When serial proteomes are available, the linking algorithm can be used to identify and correct mislabeled samples. This work also demonstrates the importance of including diverse populations in omics research and that large proteomic datasets (>â1000 proteins) can be accurately linked to a specific genome through pQTL knowledge and should not be considered unidentifiable.
Large scale proteomic studies create novel privacy considerations.
阅读:3
作者:Hill Andrew C, Guo Claire, Litkowski Elizabeth M, Manichaikul Ani W, Yu Bing, Konigsberg Iain R, Gorbet Betty A, Lange Leslie A, Pratte Katherine A, Kechris Katerina J, DeCamp Matthew, Coors Marilyn, Ortega Victor E, Rich Stephen S, Rotter Jerome I, Gerzsten Robert E, Clish Clary B, Curtis Jeffrey L, Hu Xiaowei, Obeidat Ma-En, Morris Melody, Loureiro Joseph, Ngo Debby, O'Neal Wanda K, Meyers Deborah A, Bleecker Eugene R, Hobbs Brian D, Cho Michael H, Banaei-Kashani Farnoush, Bowler Russell P
| 期刊: | Scientific Reports | 影响因子: | 3.900 |
| 时间: | 2023 | 起止号: | 2023 Jun 7; 13(1):9254 |
| doi: | 10.1038/s41598-023-34866-6 | ||
特别声明
1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。
2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。
3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。
4、投稿及合作请联系:info@biocloudy.com。
