K-Means Clustering and Classification of Breast Cancer Images Using Histogram of Oriented Gradients Features and Convolutional Neural Network Models: Diagnostic Image Analysis Study

基于方向梯度直方图特征和卷积神经网络模型的乳腺癌图像K均值聚类与分类:诊断图像分析研究

阅读:1

Abstract

BACKGROUND: Breast cancer has proven to be the most common type of cancer among females around the world. However, mortality rates can be reduced if it is diagnosed at the initial stages. Interpretation made by an expert is required by conventional diagnostic tools such as biopsies and mammograms, and this interpretation can be erroneous. Artificial intelligence has increasingly been used to detect and classify breast cancer. Yet, the acquisition of impressive reliability and accuracy continues to be challenging with these automated systems. OBJECTIVE: This study aimed to develop an innovative hybrid technique for the classification of breast cancer images involving unsupervised analysis by K-means clustering, feature extraction using Histogram of Oriented Gradients (HOG), and classification of images through a convolutional neural network (CNN). METHODS: This study used a dataset of 2788 breast cancer images categorized into benign (n=1480) and malignant (n=1308) classes. The proposed hybrid method included three stages: (1) unsupervised clustering using K-means to group visually similar features; (2) feature extraction using Histogram of Oriented Gradients (HOG) to capture texture and shape patterns; and (3) classification using a CNN trained on the extracted features. The model's performance was evaluated using standard metrics such as accuracy, precision, recall, and F1-score. RESULTS: The CNN achieved a classification accuracy of 98%, with precision, recall, and F1-score values of 0.98 for both benign and malignant cases. K-means clustering revealed distinct groups corresponding to benign and malignant tumors, indicating natural separability in the image data. CONCLUSIONS: The combination of HOG feature extraction and CNN-based classification demonstrated high performance in breast cancer detection. The model demonstrates potential utility for automated diagnosis, with possible clinical applications to assist radiologists in identifying malignant tumors more efficiently. Future research will explore additional imaging modalities and real-world clinical testing.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。