Automated grading of embroidery assignments: a multi-region deep learning framework with ResNet-50

刺绣作业自动评分:基于 ResNet-50 的多区域深度学习框架

阅读:2

Abstract

BACKGROUND: Manual grading of dental students’ embroidery assignments is not only labor-intensive but also subjective. To address these limitations, our study proposes an automated grading model based on ResNet-50 architecture enhanced with a multi-region aggregation mechanism. This approach aims to standardize the grading process, improve fairness and efficiency in assessment. METHODS: A total of 381 embroidery assignment images were collected from the 2020–2023 student cohorts. The 2022 cohort was designated as an external test set to assess model generalization with different data distributions. We proposed a multi-region aggregation mechanism based on ResNet-50 and compared two aggregation strategies: multi-head attention (MHA) aggregation and average weighting (AW) aggregation. VGG-16, DenseNet-121, ViT, and ResNet-50 were considered as baseline models. All models were trained using 5-fold cross-validation, employing a weighted CrossEntropyLoss to address class imbalance, with evaluation metrics including accuracy, precision, recall, and F1 score. RESULTS: The ResNet-50 AW model achieved the highest test accuracy of 80% on the test set, while the ViT and the VGG-16 models achieved 75%, second to ResNet-50 AW. Although models’ performance degraded on the external test set, ResNet-50 AW maintained the highest accuracy of 64% and reduced misclassifications of grade B and C samples. Despite excelling on the validation set, ResNet-50 MHA showed similar performance to ResNet-50 on the test set. ViT and VGG-16 achieved higher accuracy for grade A on both the test set and the external test set. CONCLUSION: The ResNet-50 AW model highlights the potential of deep learning methods to automate the grading of artistic assignments via a multi-region aggregation mechanism. Further validation of the model’s generalization is needed. Future work should improve dataset quality and diversity and enhance system interpretability to refine the grading process for greater accuracy and transparency.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。