Value of Machine Learning Models for Cell-Free DNA-Based Multi-Cancer Early Detection: A Systematic Review and Meta-Analysis

基于无细胞DNA的多癌种早期检测中机器学习模型的价值：系统评价和荟萃分析

阅读：6

期刊：	Technology in Cancer Research & Treatment	影响因子：	2.800
时间：	2026	起止号：	2026 Jan-Dec;25:15330338261425328
doi：	10.1177/15330338261425328	研究方向：	细胞生物学、肿瘤

Abstract

IntroductionMachine learning (ML)-based analysis of cell-free DNA (cfDNA) has emerged as a promising strategy for multi-cancer early detection (MCED). However, reported diagnostic performance varies widely across studies, and many estimates are derived from training or enriched cohorts, limiting their relevance to independent validation and real-world settings.MethodsWe conducted a systematic review and diagnostic accuracy meta-analysis of ML-based cfDNA assays for MCED. Four databases (PubMed, Embase, Web of Science, and the Cochrane Library) were searched from inception to February 2, 2025. Only independent validation or testing datasets were included; all training datasets were excluded. Pooled sensitivity, specificity, diagnostic odds ratio (DOR), and summary receiver operating characteristic (SROC) curves were estimated using a bivariate random-effects model. Subgroup analyses and meta-regression were performed to explore sources of heterogeneity.ResultsThirteen studies comprising 23 independent datasets and 14,892 participants were included. The pooled sensitivity was 0.78 (95% CI: 0.66-0.87), and the pooled specificity was 0.96 (95% CI: 0.90-0.98). The summary area under the curve (AUC) was 0.94, with a DOR of 76.6. Substantial between-study heterogeneity was observed (I(2) > 90%), with geographic region, sample size, and cfDNA biomarker type identified as major contributing factors.ConclusionML-based cfDNA assays demonstrate consistently high specificity and moderate-to-high sensitivity across independent validation datasets, supporting their potential role in multi-cancer early detection. However, diagnostic performance is highly context dependent and strongly influenced by study design, population characteristics, and analytical choices. These findings highlight the need for large-scale, prospective, population-based validation before widespread clinical implementation.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用；引用内容仅为补充信息，不代表本站立场。

2、若认为本页面引用内容涉及侵权，请及时与本站联系，我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容，需注明“来源：[生知库]”并获得授权；使用引用内容的，需自行联系原作者获得许可。

4、投稿及合作请联系：info@biocloudy.com。