Abstract
Varieties show their unique characteristics in morphology, growth, and fruits. Tomato maturity is related to multiple dimensional characteristics including color, texture, smell, etc. An effective classification method of tomato variety and maturity is crucial for evaluating its growth and yield. However, due to the complex growth environment, some problems such as leaf occlusion and fruit shaded by each other make it difficult to accurately and efficiently identify them. To solve these problems, this study innovatively proposes a simultaneous detection model on tomato variety and maturity based on improved YOLOv8n, with the combination of frequency-adaptive dilated convolution (FADC) feature extraction module and the high-level screening-feature path aggregation network (HSPAN) with the aim of local and global feature fusion by the channel attention module and feature selection fusion mechanism. In addition, we use the Powerful-IoU (PIoU) loss function to replace the original Complete IoU (CIoU) to enhance the accuracy of bounding boxes. We also introduce a dynamic detection head as the final output of the model, which can adaptively adjust the focus of feature extraction according to the color and size of tomato fruits, thereby improving the recognition accuracy. Experimental results show that our model with better global perception capability achieves the highest detection accuracy and lower computation complexity among the comparative models.