Attention-guided hybrid learning for accurate defect classification in manufacturing environments

注意力引导混合学习在制造环境中实现精确缺陷分类

阅读:2

Abstract

Industrial defect classification remains a crucial challenge due to the visual complexity, rarity, and diversity of defect types encountered in manufacturing. In this paper, we present a hybrid deep learning framework that integrates YOLOv11 and EfficientNet-B7 to perform robust multi-class defect classification. Our model combines the semantic richness of YOLO's spatial features with the fine-grained representation power of EfficientNet, further enhanced by a Convolutional Block Attention Module (CBAM) and a lightweight Feature Pyramid Network (FPN) for attention-guided multi-scale refinement. Unlike conventional anomaly detectors or class-specific models, our framework supports unified classification across diverse object categories and defect types. We evaluate the proposed model on two datasets: the MVTec-FS benchmark, which includes 46 defect types across 14 industrial categories, and our proprietary Window dataset, which comprises three real-world defect classes under variable conditions. The model achieves a state-of-the-art accuracy of 91.90% on MVTec-FS and 96.13% on the Window dataset, outperforming existing CNN, transformer, and ensemble baselines. Ablation studies further demonstrate the incremental contribution of each module to the overall performance. These results validate our model's ability to generalize across domains and offer practical utility in industrial quality control pipelines.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。