Machine Learning-Based Morphological Classification and Diversity Analysis of Ornamental Pumpkin Seeds

基于机器学习的观赏南瓜种子形态分类与多样性分析

阅读:1

Abstract

Ornamental pumpkin (Cucurbita pepo L. var. ovifera) seeds are highly morphologically variable, and their classification is hence a complex task for the seed industry. Efficient and accurate classification is critical for agricultural production, breeding programs, and seed sorting for commerce. This study employs machine learning models-Random Forest (RF), LightGBM, and k-Nearest Neighbors (KNN)-to classify ornamental pumpkin seeds based on their morphological (mass, elongation, width, thickness) and colorimetric characteristics (L*, a*, b* values from CIELAB color space). Prior to model training, the data set was preprocessed through normalization and balancing to enhance classification performance. In this study, six different types of ornamental pumpkin seeds were used, with a total of 900 (150 each of SDE0619, SDE1020, SDE1620, SDE2621, SDE4521, and SDE7721). The classification performance of the models was evaluated using different metrics, such as Accuracy, Balanced Accuracy, Precision, Recall, F1 Score, Matthews Correlation Coefficient (MCC), and Cohen's Kappa. Among the tested models, the RF model performed best, with Accuracy of 0.959, Balanced Accuracy of 0.961, Precision (Macro) of 0.962, Recall (Macro) of 0.961, F1 Score (Macro) of 0.961, MCC of 0.951, and Cohen's Kappa of 0.951. In contrast, the worst classification performance of the tested models was with the KNN model across all the evaluation metrics. These outcomes reflect the potential of machine learning-based approaches for seed classification automation, error minimization in seed classification, and maximization of efficiency in the seed industry. The high classification performance of the Random Forest model with 95.9% accuracy and 0.951 MCC value shows that artificial intelligence-based automatic classification of ornamental pumpkin seeds according to their morphological and colorimetric characteristics can make significant contributions to the seed industry, while the integration of this approach into seed sorting and quality determination processes can enable the creation of effective breeding schemes for optimum seed selection by maximizing the accuracy of agricultural processes.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。