Identification of tobacco leaf diseases using hyperspectral imaging and machine learning with SHAP interpretability analysis

利用高光谱成像和机器学习技术结合SHAP可解释性分析识别烟叶病害

阅读:3

Abstract

Tobacco leaf diseases significantly affect yield and quality, underscoring the need for rapid and non-destructive diagnostic tools. Although hyperspectral imaging (HSI) has been applied in tobacco pathology, most existing studies focus on single diseases and lack generalized, interpretable frameworks for multi-class identification. In this study, hyperspectral images of healthy leaves and four major diseases-brown spot, wildfire, Tobacco Mosaic Virus (TMV), and Potato virus Y (PVY)-were collected to construct a balanced, leaf-independent dataset. Pixels were grouped by leaf ID, and the entire dataset was strictly partitioned at the leaf level to prevent pixel-level data leakage and ensure generalization to unseen leaves. Multiple preprocessing techniques, wavelength-selection methods, and machine-learning classifiers were systematically compared. A compact ANN model integrating Savitzky-Golay preprocessing and SPA-based wavelength selection achieved the best overall performance while requiring only a small number of informative wavelengths. A Transformer model provided slightly stronger predictive capacity but depended on full-spectrum inputs and substantially higher computational cost. Pixel-level predictions enabled lesion-area-based severity estimation for the two leaf-spot diseases. SHAP analysis highlighted physiologically meaningful spectral regions associated with pigment absorption and structural variation. Overall, this study presents an efficient and interpretable HSI framework for multi-disease tobacco diagnosis, supporting the development of practical hyperspectral or multispectral systems.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。