Serum biomarker-based early detection of pancreatic ductal adenocarcinomas with ensemble learning

基于血清生物标志物的集成学习胰腺导管腺癌早期检测

阅读:10
作者:Nuno R Nené, Alexander Ney, Tatiana Nazarenko, Oleg Blyuss, Harvey E Johnston, Harry J Whitwell, Eva Sedlak, Aleksandra Gentry-Maharaj, Sophia Apostolidou, Eithne Costello, William Greenhalf, Ian Jacobs, Usha Menon, Justin Hsuan, Stephen P Pereira, Alexey Zaikin, John F Timms

Background

Earlier detection of pancreatic ductal adenocarcinoma (PDAC) is key to improving patient outcomes, as it is mostly detected at advanced stages which are associated with poor survival. Developing non-invasive blood tests for early detection would be an important breakthrough.

Conclusions

The ensemble modelling strategy explored here outperforms considerably biomarker combinations cited in the literature. Further developments in the selection of classifiers balancing performance and heterogeneity should further enhance the predictive capacity of the method.

Methods

The primary objective of the work presented here is to use a dataset that is prospectively collected, to quantify a set of cancer-associated proteins and construct multi-marker models with the capacity to predict PDAC years before diagnosis. The data used is part of a nested case-control study within the UK Collaborative Trial of Ovarian Cancer Screening and is comprised of 218 samples, collected from a total of 143 post-menopausal women who were diagnosed with pancreatic cancer within 70 months after sample collection, and 249 matched non-cancer controls. We develop a stacked ensemble modelling technique to achieve robustness in predictions and, therefore, improve performance in newly collected datasets.

Results

Here we show that with ensemble learning we can predict PDAC status with an AUC of 0.91 (95% CI 0.75-1.0), sensitivity of 92% (95% CI 0.54-1.0) at 90% specificity, up to 1 year prior to diagnosis, and at an AUC of 0.85 (95% CI 0.74-0.93) up to 2 years prior to diagnosis (sensitivity of 61%, 95% CI 0.17-0.83, at 90% specificity). Conclusions: The ensemble modelling strategy explored here outperforms considerably biomarker combinations cited in the literature. Further developments in the selection of classifiers balancing performance and heterogeneity should further enhance the predictive capacity of the method.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。