Bioinformatic approach to the genetics of preeclampsia

先兆子痫遗传学的生物信息学方法

阅读:1

Abstract

OBJECTIVE: To identify candidate genes and genetic variants for preeclampsia using a bioinformatic approach to extract and organize genes and variants from the published literature. METHODS: Semantic data-mining and natural language processing were used to identify articles from the published literature meeting criteria for potential association with preeclampsia. Articles were manually reviewed by trained curators. Cluster analysis was used to aggregate the extracted genes into gene sets associated with preeclampsia or severe preeclampsia, early or late preeclampsia, maternal or fetal tissue sources, and concurrent conditions (ie, fetal growth restriction, gestational hypertension, or hemolysis, elevated liver enzymes, and low platelet count [HELLP]). Gene ontology was used to organize this large group of genes into ontology groups. RESULTS: From more than 22 million records in PubMed, with 28,000 articles on preeclampsia, our data-mining tool identified 2,300 articles with potential genetic associations with preeclampsia-related phenotypes. After curation, 729 articles were "accepted" that contained "statistically significant" associations with 535 genes. We saw distinct segregation of these genes by severity and timing of preeclampsia, by maternal or fetal source, and with associated conditions (eg, gestational hypertension, fetal growth restriction, or HELLP syndrome). CONCLUSION: The gene sets and ontology groups identified through our systematic literature curation indicate that preeclampsia represents several distinct phenotypes with distinct and overlapping maternal and fetal genetic contributions. LEVEL OF EVIDENCE: III.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。