Comparative Analysis of Machine Learning Methods for Breast Cancer Classification in Genetic Sequences

基于机器学习的乳腺癌基因序列分类方法比较分析

阅读:1

Abstract

Breast cancer is the leading cancer in women, which accounts for millions of deaths worldwide. Early and accurate detection, prognosis, cure, and prevention of breast cancer is a major challenge to society. Hence, a precise and reliable system is vital for the classification of cancerous sequences. Machine learning classifiers contribute much to the process of early prediction and diagnosis of cancer. In this paper, a comparative study of four machine learning classifiers such as random forest, decision tree, AdaBoost, and gradient boosting is implemented for the classification of a benign and malignant tumor. To derive the most efficient machine learning model, NCBI datasets are utilized. Performance evaluation is conducted, and all four classifiers are compared based on the results. The aim of the work is to derive the most efficient machine-learning model for the diagnosis of breast cancer. It was observed that gradient boosting outperformed all other models and achieved a classification accuracy of 95.82%.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。