Abstract
The reliable classification of medicinal plant species plays a vital role in ensuring their quality, authenticity, and safe use in healthcare. However, existing methods often face difficulties when species exhibit strong visual similarities or when datasets are imbalanced, which limits their effectiveness in practice. Although deep learning models such as ResNet18 and VGG16 have proven influential in image recognition tasks, our experiments showed that they tended to overfit, with validation losses reaching 42.99 % and test accuracy falling to 73.99 % in certain groups. To overcome these challenges, we introduce a multi-level fusion feature model that combines 3D normalized color histograms, extended uniform Local Binary Patterns (LBP with P = 24, R = 3), multi-orientation Gabor filters, and Histogram of Oriented Gradients (HOG). This approach captures a richer set of visual cues by bringing together global color statistics, detailed textures, frequency-domain patterns, and shape descriptors. We incorporate SMOTE-based synthetic augmentation to address further class imbalance, which helps balance feature distributions across categories. We employ a soft-voting ensemble of machine learning classifiers for classification and use cosine similarity metrics to capture inter-class relationships better. Tests on Indian medicinal plant datasets show that our model consistently outperforms deep learning baselines, reaching 100 % accuracy in Group 1, 95.82 % in Group 3, and over 90 % in other groups. These results suggest that the proposed model offers a more robust and computationally efficient solution for plant species classification, particularly under conditions of high inter-class similarity and dataset imbalance.•The proposed domain-specific model can be applied explicitly to Indian plant species groups exhibiting high inter-class visual similarities through a novel feature fusion strategy.•The proposed multi-level feature fusion method's innovation integrates 3D normalized color histograms, extended uniform LBP (P = 24, R = 3), multi-orientation Gabor filters, and HOG features to capture the color, texture, and shape characteristics.•The proposed work offers a scalable ensemble framework for inter-class similarity analysis by combining SMOTE-based class balancing, feature normalization, and a soft-voting ensemble of diverse classifiers that support biodiversity and ecological studies.