Precise leaf damage detection across diverse species and environments via a large-scale vision model

利用大规模视觉模型,对不同物种和环境下的叶片损伤进行精确检测

阅读:3

Abstract

Precise detection of crop leaf damage is essential for real-time plant health monitoring and yield estimation. However, conventional deep learning models often exhibit poor generalization when deployed across varying species and complex, unstructured field environments. To address these limitations, we propose a new modeling paradigm that shifts from traditional task-specific training to foundation model adaptation. Specifically, we introduce a novel architecture integrating the DinoV3 foundation model with a Unet framework to achieve robust leaf lesion segmentation. By incorporating a Spatial Prior Module (SPM) and a Projection Module, our approach effectively bridges the gap between general-purpose pre-training and domain-specific requirements. Experimental results on coffee and black gram datasets demonstrate that this paradigm consistently outperforms standard networks, including Unet, Unet++, and SwinUnet. On the coffee leaf dataset, the proposed model achieves an Intersection over Union (IoU) of 78.31% and a Pixel Accuracy of 88.00%, surpassing the baseline Unet by over 10.5% in IoU. Remarkably, the architecture reduces inference time by approximately 93.6% (from 63.41s to 4.07s), proving that high-parameter foundation models can be adapted for extreme computational efficiency in agricultural scenarios. To further validate scalability, we conduct additional experiments on a larger dataset, AMG (HS) . The proposed paradigm achieves the best overall detection performance while maintaining superior computational efficiency, confirming its robustness under increased data scale. Interpretability analysis reveals that the foundation model backbone effectively captures high-level semantic features of lesions, providing a clear explanation for its superior performance and cross-domain reliability. This research establishes a scalable, high-performance paradigm for intelligent crop protection, demonstrating that coupling customized encoders with foundation models is a superior strategy for cross-domain agricultural tasks.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。