Improved food recognition using a refined ResNet50 architecture with improved fully connected layers

使用改进的 ResNet50 架构和改进的全连接层来提升食物识别能力。

阅读:1

Abstract

Food consumption has significant effects on human health, particularly in relation to quality control, quantity, freshness, and color. This study focuses on identifying food across 16 categories, divided into breakfast, lunch, and dinner, to evaluate its impact on the human body, specifically in hospital and restaurant settings. The recognition system was used a machine vision system and deep learning algorithms to record food consumption videos, extracting images. After preprocessing, a raw dataset was bulit that consisted of 12,000 images, expanded to 66,000 images through data augmentation. Five deep learning algorithms were used for recognizing food and consumed food. ResNet50 was the best algorithm in comparison to other deep learning architectures. The effect of Hyper-parameters such as data augmentation, batch size, image size, and learning rate on performance of Resnet50 were analyzed. Transfer learning method led us to develop three versions: standard ResNet50, fine-tuned ResNet50, and optimized ResNet50 with a customized fully connected layer. ResNet50 with a specific dense layer was the best development version of ResNet50. This model with Adam optimizer, 10(-3) initial learning rate, batch size 4, and image size 340 × 640 could recognize various foods with 97.25% accuracy and 0.2 loss. Response time and training time of this architecture compared to other algorithms were confidential; the training process and response time were 5.30 h and 1.2 s. ResNet50 with a specific fully connected layer powerfully could complete tasks with high accuracy and the least time.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。