Biomedical Literature Mining for Repurposing Laboratory Tests

生物医学文献挖掘在实验室检测方法再利用中的应用

阅读:1

Abstract

Epidemiological studies identifying biological markers of disease state are valuable, but can be time-consuming, expensive, and require extensive intuition and expertise. Furthermore, not all hypothesized markers will be borne out in a study, suggesting that high-quality initial hypotheses are crucial. In this chapter, we describe a high-throughput pipeline to produce a ranked list of high-quality hypothesized biomarkers for diseases. We review an example use of this approach to generate a large number of candidate disease biomarker hypotheses derived from machine learning models, filter and rank them according to their potential novelty using text mining, and corroborate the most promising hypotheses with further statistical modeling. The example use of the pipeline uses a large electronic health record dataset and the PubMed corpus, to find several promising hypothesized laboratory tests with previously undocumented correlations to particular diseases.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。