Abstract
Deep learning-based computer vision technology significantly improves the accuracy and efficiency of crop disease detection. However, the scarcity of crop disease images leads to insufficient training data, limiting the accuracy of disease recognition and the generalization ability of deep learning models. Therefore, increasing the number and diversity of high-quality disease images is crucial for enhancing disease monitoring performance. We design a frequency-domain and wavelet image augmentation network with a dual discriminator structure (FHWD). The first discriminator distinguishes between real and generated images, while the second high-frequency discriminator is specifically used to distinguish between the high-frequency components of both. High-frequency details play a crucial role in the sharpness, texture, and fine-grained structures of an image, which are essential for realistic image generation. During training, we combine the proposed wavelet loss and Fast Fourier Transform loss functions. These loss functions guide the model to focus on image details through multi-band constraints and frequency domain transformation, improving the authenticity of lesions and textures, thereby enhancing the visual quality of the generated images. We compare the generation performance of different models on ten crop diseases from the PlantVillage dataset. The experimental results show that the images generated by FHWD contain more realistic leaf disease lesions, with higher image quality that better aligns with human visual perception. Additionally, in classification tasks involving nine types of tomato leaf diseases from the PlantVillage dataset, FHWD-enhanced data improve classification accuracy by an average of 7.25% for VGG16, GoogleNet, and ResNet18 models.Our results show that FHWD is an effective image augmentation tool that effectively addresses the scarcity of crop disease images and provides more diverse and enriched training data for disease recognition models.