Paddy pest image segmentation based on multiscale attention fusion VM-UNet

基于多尺度注意力融合的稻田病虫害图像分割 VM-UNet

阅读:1

Abstract

Precise paddy pest image segmentation (PPIS) in the real-time natural environments is an important and challenging research. Convolutional Neural Networks (CNNs) and Transformers are the most popular architectures for image segmentation, but they usually have limitations in modeling global dependencies and quadratic computational complexity, respectively. A multiscale attention fusion VM-UNet (MSAF-VMUNet) for PPIS is constructed. It integrates the long-range dependencies modeling ability of Visual State Space Model (VSS) and the precise positioning capability of U-Net with low computational complexity. In the model, multiscale VSS (MSVSS) block is used to capture the long-range contextual information, and improved attention fusion (IAF) module is designed for multi-level feature learning between Encoder and Decoder. Attention VSS module is introduced in the bottleneck layer to enable the model to adaptively emphasize key features and suppress redundant information. Compared with VM-UNet, MSAF-VMUNet can effectively model global-local and context relationships at the scale layer, and improve the detection performance of various pests in size and shape without increasing computational complexity. The experimental results on the paddy pest subset of the public IP102 dataset validate that MSAF-VMUNet can effectively address the key challenges in field PPIS, including small pest detection, occlusion and noise handling, and preprocessing requirements, and the PPIS presion is 79.17%, which are 15.51% and 3.39% higher than those of the traditional U-Net and the recent VM-UNet, respectively. It provides an effective and reliable solution for pest control detection system in smart agriculture.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。