Comparative Analysis of Machine-Learning Algorithms for Accurate Diagnosis of Lung Diseases Using Chest X-ray Images: A Study on Balanced and Unbalanced Data on Segmented and Unsegmented Images

基于胸部X光图像的机器学习算法在肺部疾病精准诊断中的比较分析:基于分割图像和未分割图像的平衡数据与非平衡数据研究

阅读:1

Abstract

The study focused on the accurate diagnosis of lung diseases, considering the high number of lung disease-related deaths in the world. Chest x-ray images were used as they are a cost-effective and widely available diagnostic tool. Eight different machine learning algorithms were evaluated: Logistic Regression, Naive Bayes, k-Nearest Neighbors (kNN), Decision Tree, Random Forest, Support Vector Machine (SVM), Ridge, and Least Absolute Shrinkage and Selection Operator (LASSO). The study evaluated balanced and imbalanced datasets and looked at both segmented and unsegmented chest x-ray images. COVID-19, pneumonia, normal, and others were the four classes that were used in the investigation. Prior to attribute reduction, Decision Tree and Random Forest performed well on the balanced dataset, obtaining 74% test accuracy and 92% training accuracy. SVM functioned well as well, obtaining a 74% test accuracy. Principal Component Analysis (PCA) and Linear Discriminant Analysis (LDA) are two attribute reduction approaches that were applied. Decision Trees and Random Forests were able to attain the maximum training accuracy of 92%, while SVM was able to retain a test accuracy of 74% after attribute reduction. The findings also imply that some algorithms' performance may be enhanced by attribute reduction methods like PCA and LDA. For imbalanced data, Random Forest and SVM perform the best in terms of balanced accuracy of 80%. However, further research and experimentation may be needed to optimize the models and explore other potential algorithms or techniques.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。