Utilizing Mask R-CNN for Solid-Volume Food Instance Segmentation and Calorie Estimation-Reference-Cited by-同舟云学术

Utilizing Mask R-CNN for Solid-Volume Food Instance Segmentation and Calorie Estimation

Published:2022-10-28 Issue:21 Volume:12 Page:10938
ISSN:2076-3417
Container-title:Applied Sciences
language:en
Short-container-title:Applied Sciences

Author:

Dai Yanyan,Park Subin^ORCID,Lee Kidong

Abstract

To prevent or deal with chronic diseases, using a smart device, automatically classifying food categories, estimating food volume and nutrients, and recording dietary intake are considered challenges. In this work, a novel real-time vision-based method for solid-volume food instance segmentation and calorie estimation is utilized, based on Mask R-CNN. In order to address the proposed method in real life, distinguishing it from other methods which use 3D LiDARs or RGB-D cameras, this work applies RGB images to train the model and uses a simple monocular camera to test the result. Gimbap is selected as an example of solid-volume food to show the utilization of the proposed method. Firstly, in order to improve detection accuracy, the proposed labeling approach for the Gimbap image datasets is introduced, based on the posture of Gimbap in plates. Secondly, an optimized model to detect Gimbap is created by fine-tuning Mask R-CNN architecture. After training, the model reaches AP (0.5 IoU) of 88.13% for Gimbap1 and AP (0.5 IoU) of 82.72% for Gimbap2. mAP (0.5 IoU) of 85.43% is achieved. Thirdly, a novel calorie estimation approach is proposed, combining the calibration result and the Gimbap instance segmentation result. In the fourth section, it is also shown how to extend the calorie estimation approach to be used in any solid-volume food, such as pizza, cake, burger, fried shrimp, oranges, and donuts. Compared with other food calorie estimation methods based on Faster R-CNN, the proposed method uses mask information and considers unseen food. Therefore, the method in this paper outperforms the accuracy of food segmentation and calorie estimation. The effectiveness of the proposed approaches is proven.

Funder

Korea Institute for Advancement of Technology

Publisher

MDPI AG

Subject

Fluid Flow and Transfer Processes,Computer Science Applications,Process Chemistry and Technology,General Engineering,Instrumentation,General Materials Science

Link

https://www.mdpi.com/2076-3417/12/21/10938/pdf

Reference31 articles.

1. Tahir, G.A., and Loo, C.K. A Comprehensive Survey of Image-Based Food Recognition and Volume Estimation Methods for Dietary Assessment. Healthcare, 2021. 9.

2. Kaur, P., Sikka, K., Wang, W., Belongie, S.J., and Divakaran, A. Foodx-251: A dataset for fine-grained food classification. arXiv, 2019.

3. Multi-Scale Multi-View Deep Feature Aggregation for Food Recognition;Jiang;IEEE Trans. Image Process.,2019

4. Zhao, H., Yap, K.-H., and Kot, A.C. Fusion Learning using Semantics and Graph Convolutional Network for Visual Food Recognition. Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision.

5. An exploratory study on a chest-worn computer for evaluation of diet, physical activity and lifestyle;Sun;J. Healthc. Eng.,2015

Cited by 4 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Efficient Food Image Segmentation Using YOLOv5: A Step Towards Automated Food Recognition;2024 Third International Conference on Distributed Computing and Electrical Circuits and Electronics (ICDCECE);2024-04-26

2. Multi-Spectral Food Classification and Caloric Estimation Using Predicted Images;Foods;2024-02-11

3. AI-based digital image dietary assessment methods compared to humans and ground truth: a systematic review;Annals of Medicine;2023-12-07

4. Multispectral Food Classification and Caloric Estimation Using Convolutional Neural Networks;Foods;2023-08-25