Developing a robust two-step machine learning multiclassification pipeline to predict primary site in head and neck carcinoma from lymph nodes

开发一种稳健的两步机器学习多分类流程,用于根据淋巴结预测头颈部癌的原发部位

阅读:1

Abstract

This study aimed to develop a robust multiclassification pipeline to determine the primary tumor location in patients with head and neck carcinoma of unknown primary using radiomics and machine learning techniques. The dataset included 400 head and neck cancer patients with primary tumor in oropharynx, OPC (n = 162), nasopharynx, NPC (n = 137), oral cavity, OC (n = 63), larynx and hypopharynx, HL (n = 38). Two radiomic-based multiclassification pipelines (P1 and P2) were developed. P1 consisted in a direct identification of the primary sites, whereas P2 was based on a two-step approach: in the first step, the number of classes was reduced by merging the two minority classes which were reclassified in the second step. Diverse correlation thresholds (0.75, 0.80, 0.85), feature selection methods (sequential forwards/backwards selection, sequential floating forward selection, neighborhood component analysis and minimum redundancy maximum relevance), and classification models (neural network, decision tree, naïve Bayes, bagged trees and support vector machine) were assessed. P2 outperformed P1, with the best results obtained with the support vector machine classifier including radiomic and clinical features (accuracies of 75.3 % (HL), 75.4 % (OC), 71.3 % (OPC), 92.9 % (NPC)). These results indicate that the two-step multiclassification pipeline integrating radiomics and clinical information is a promising approach to predict the tumor site of unknown primary.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。