Multiclass leukemia cell classification using hybrid deep learning and machine learning with CNN-based feature extraction

基于混合深度学习和机器学习以及基于卷积神经网络特征提取的多类别白血病细胞分类

阅读:2

Abstract

Leukemia is the most prevalent form of blood cancer, affecting individuals across all age groups. Early and accurate diagnosis is crucial for effective treatment and improved clinical outcomes. Peripheral blood smear analysis, a key non-invasive diagnostic tool, often suffers from subjective interpretation, inter-observer variability, and a lack of readily available expertise. Although deep learning approaches, particularly Convolutional Neural Networks (CNNs), have demonstrated exceptional performance in binary classification tasks, multiclass classification of leukemia subtypes remains challenging due to limited data availability and morphological similarities between subtypes. This study presents a novel hybrid methodology that combines pre-trained CNN architectures, including VGG16, InceptionV3, and ResNet50, with advanced classification models such as Random Forest (RF), Support Vector Machine (SVM), Extreme Gradient Boosting (XGBoost), and the deep learning-based Multi-Layer Perceptron (MLP). The method leverages publicly available datasets, the Acute Lymphoblastic Leukemia Image Database (ALL-IDB) and the Munich AML Morphology Dataset, to classify healthy cells, lymphoblasts, and myeloblasts. Pre-trained CNNs are employed for feature extraction, while the classifiers refine the predictions for improved accuracy. The proposed approach demonstrated exceptional performance, with the InceptionV3 + SVM combination achieving the highest accuracy of 88%, followed closely by VGG16 + XGBoost at 87%. MLP-based models also achieved strong results, effectively capturing non-linear patterns in the data. In contrast, ResNet50 exhibited limitations, likely due to overfitting caused by the small dataset. The novelty of this work lies in the integration of pre-trained deep learning architectures with hybrid classification techniques, enabling robust multiclass classification in data-constrained scenarios. This innovative approach offers a scalable and precise diagnostic tool, improving the speed and reliability of leukemia subtype identification and providing significant potential to enhance clinical decision-making and patient care.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。