A swin transformer-based hybrid reconstruction discriminative network for image anomaly detection

一种基于Swin Transformer的混合重构判别网络用于图像异常检测

阅读:1

Abstract

Industrial anomaly detection algorithms based on Convolutional Neural Networks (CNN) often struggle with identifying small anomaly regions and maintaining robust performance in noisy industrial environments. To address these limitations, this paper proposes the Swin Transformer-Based Hybrid Reconstruction Discriminative Network (SRDAD), which combines the global context modeling capabilities of Swin Transformer with complementary reconstruction and discrimination approaches. Our approach introduces three key contributions: a natural anomaly image generation module that produces diverse simulated anomalies resembling real-world defects; a Swin-Unet based reconstruction subnetwork with enhanced residual and pooling modules for accurate normal image reconstruction, utilizing hierarchical window attention mechanisms, and an anomaly contrast discrimination subnetwork based on convolutional Unet that enables end-to-end detection and localization through contrastive learning. This hybrid approach combines reconstruction and discrimination paradigms to improve anomaly detection performance. Experimental results on the industrial dataset MVTec AD demonstrate that SRDAD achieves competitive performance, with improvements of 0.6% in detection accuracy and 0.7% in localization precision. The method demonstrates improved performance in detecting small anomalies and maintaining performance in noisy environments, highlighting its potential for industrial applications.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。