Few-shot Food Recognition via Multi-view Representation Learning-Reference-Cited by-同舟云学术

Few-shot Food Recognition via Multi-view Representation Learning

Published:2020-08-31 Issue:3 Volume:16 Page:1-20
ISSN:1551-6857
Container-title:ACM Transactions on Multimedia Computing, Communications, and Applications
language:en
Short-container-title:ACM Trans. Multimedia Comput. Commun. Appl.

Author:

Jiang Shuqiang¹^ORCID,Min Weiqing¹,Lyu Yongqiang²,Liu Linhu¹

Affiliation:

1. Key Laboratory of Intelligent Information Processing, Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China

2. Qingdao KingAgroot Precision Agriculture Technology Co., Ltd, Qingdao, Shandong, China

Abstract

This article considers the problem of few-shot learning for food recognition. Automatic food recognition can support various applications, e.g., dietary assessment and food journaling. Most existing works focus on food recognition with large numbers of labelled samples, and fail to recognize food categories with few samples. To address this problem, we propose a Multi-View Few-Shot Learning (MVFSL) framework to explore additional ingredient information for few-shot food recognition. Besides category-oriented deep visual features, we introduce ingredient-supervised deep network to extract ingredient-oriented features. As general and intermediate attributes of food, ingredient-oriented features are informative and complementary to category-oriented features, and thus they play an important role in improving food recognition. Particularly in few-shot food recognition, ingredient information can bridge the gap between disjoint training categories and test categories. To take advantage of ingredient information, we fuse these two kinds of features by first combining their feature maps from their respective deep networks and then convolving combined feature maps. Such convolution is further incorporated into a multi-view relation network, which is capable of comparing pairwise images to enable fine-grained feature learning. MVFSL is trained in an end-to-end fashion for joint optimization on two types of feature learning subnetworks and relation subnetworks. Extensive experiments on different food datasets have consistently demonstrated the advantage of MVFSL in multi-view feature fusion. Furthermore, we extend another two types of networks, namely, Siamese Network and Matching Network, by introducing ingredient information for few-shot food recognition. Experimental results have also shown that introducing ingredient information into these two networks can improve the performance of few-shot food recognition.

Funder

National Program for Special Support of Eminent Professionals

National Natural Science Foundation of China

National Program for Support of Top-notch Young Professionals

Beijing Natural Science Foundation

Publisher

Association for Computing Machinery (ACM)

Subject

Computer Networks and Communications,Hardware and Architecture

Link

https://dl.acm.org/doi/pdf/10.1145/3391624

Reference66 articles.

1. Food Balance Estimation by Using Personal Dietary Tendencies in a Multimedia Food Log

2. Social Media Image Recognition for Food Trend Analysis

3. Menu-Match: Restaurant-Specific Food Logging from Images

Cited by 29 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. MiniTomatoNet: a lightweight CNN for tomato leaf disease recognition on heterogeneous FPGA-SoC;The Journal of Supercomputing;2024-06-17

2. Food Computing for Nutrition and Health;2024 IEEE 40th International Conference on Data Engineering Workshops (ICDEW);2024-05-13

3. Multi-Content Interaction Network for Few-Shot Segmentation;ACM Transactions on Multimedia Computing, Communications, and Applications;2024-03-08

4. Few shot learning for avocado maturity determination from microwave images;Journal of Agriculture and Food Research;2024-03

5. Viewpoint Disentangling and Generation for Unsupervised Object Re-ID;ACM Transactions on Multimedia Computing, Communications, and Applications;2024-01-22