A multi-scale attention-based Swin transformer model for medical images segmentation

一种基于多尺度注意力机制的Swin Transformer模型用于医学图像分割

阅读:1

Abstract

Medical image segmentation is crucial in accurately diagnosing diseases and assisting physicians in examining relevant areas. Therefore, there is a pressing need for an artificial intelligence-based model that can facilitate the diagnostic process and reduce errors. Existing networks often have high parameters, high gigaflops, and low accuracy. This research addresses this gap by proposing a transformer-based architecture. We developed a Swin transformer as an encoder to improve segmentation accuracy in medical images by leveraging deep feature extraction. This model can extract key image features more accurately by employing sliding windows and an attention mechanism. The overarching goal of this research is to design an optimized architecture for medical image segmentation that maintains high accuracy while reducing the number of network parameters and minimizing computational costs. In the decoder section, we designed the dynamic feature fusion block (DFFB) to enhance the extracted features, enabling the extraction of multi-scale features. This capability enables the model to analyze the structural information of medical images at various levels, resulting in improved performance in segmenting complex regions. We also employed the dynamic attention enhancement block to further enhance the features extracted from the DFFB output. This block utilizes spatial and channel attention mechanisms to emphasize key areas in the images, thereby enhancing the model's overall accuracy. The proposed model achieved segmentation performance across three medical datasets, obtaining a mean intersection over union (mIoU) of 0.9125 and a Dice score of 0.9542 on GlaS, an mIoU of 0.9174 and a Dice score of 0.9569 on PH2, and an mIoU of 0.9085 and a Dice score of 0.9521 on Kvasir-SEG. The experiments illustrate that the proposed model outperforms previous methods, demonstrating its potential as an effective tool in medical image segmentation.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。