An ensemble of vision and swin transformers with LLM-based explanations for sugarcane leaf disease diagnosis

基于LLM解释的视觉和SWIN Transformer集成模型用于甘蔗叶片病害诊断

阅读:1

Abstract

Sugarcane diseases significantly reduce crop yield and quality, posing persistent challenges to the agricultural sector. This study presents a novel ensemble framework that integrates Vision Transformer and Swin Transformer architectures for accurate sugarcane leaf disease detection. By combining global self-attention with localized window-based attention mechanisms, the proposed model effectively captures multi-scale visual features associated with diverse disease symptoms. Experimental evaluation on a large, labeled sugarcane leaf dataset achieved a validation accuracy of 98.16% and a test accuracy of 97.06%, outperforming several convolutional neural network baselines. Additionally, a large language model (LLM) interface is employed as a post-prediction decision-support module, generating disease-specific descriptions and management suggestions based solely on the predicted disease class. This integrated framework indicates the potential effectiveness of transformer-based ensemble models combined with intelligent advisory support for practical decision-making in precision agriculture.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。