Revisiting Named Entity Recognition in Food Computing: Enhancing Performance and Robustness-Reference-Cited by-同舟云学术

Revisiting Named Entity Recognition in Food Computing: Enhancing Performance and Robustness

Published:2023-11-15 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Akujuobi Uchenna¹,Liu Shuhong²,Besold Tarek R.¹

Affiliation:

1. Sony AI

2. University of Tokyo

Abstract

AbstractIn the ever-evolving domain of food computing, Named Entity Recognition (NER) presents transformative potential that extends far beyond mere word tagging in recipes. Its implications encompass intelligent recipe recommendations, health analysis, and personalization. Nevertheless, existing NER models in food computing encounter challenges stemming from variations in recipe input standards, limited annotations, and dataset quality. This article addresses the specific problem of ingredient NER and introduces two innovative models:SINERA, an efficient and robust model, andSINERAS, a semi-supervised variant that leverages a Gaussian Mixture Model (GMM) to learn from untagged ingredient list entries. To mitigate issues associated with data quality and availability in food computing, we introduce theSINERAdataset, a diverse and comprehensive repository of ingredient lines. Additionally, we identify and tackle a pervasive challenge---spurious correlations between entity positions and predictions. To address this, we propose a set of data augmentation rules tailored for food NER. Extensive evaluations conducted on theSINERAdataset and a revisedTASTEsetdataset underscore the performance of our models. They outperform several state-of-the-art benchmarks and rival the BERT model while maintaining smaller parameter sizes and reduced training times.

Publisher

Research Square Platform LLC

Reference78 articles.

1. Nordstr{\"o}m, Karin and Coff, Christian and J{\"o}nsson, H{\aa}kan and Nordenfelt, Lennart and G{\"o}rman, Ulf (2013) Food and health: individual, cultural, or scientific matters?. Genes & nutrition 8(4): 357--363 BioMed Central

2. Achananuparp, Palakorn and Lim, Ee-Peng and Abhishek, Vibhanshu (2018) Does journaling encourage healthier choices? Analyzing healthy eating behaviors of food journalers. 35--44, Proceedings of the 2018 International Conference on Digital Health

3. Ludwig, David S and Willett, Walter C and Volek, Jeff S and Neuhouser, Marian L (2018) Dietary fat: from foe to friend?. Science 362(6416): 764--770 American Association for the Advancement of Science

4. Menichetti, Giulia and Ravandi, Babak and Mozaffarian, Dariush and Barab{\'a}si, Albert-L{\'a}szl{\'o} (2023) Machine learning prediction of the degree of food processing. Nature Communications 14(1): 2312 Nature Publishing Group UK London

5. Min, Weiqing and Jiang, Shuqiang and Liu, Linhu and Rui, Yong and Jain, Ramesh (2019) A survey on food computing. ACM Computing Surveys (CSUR) 52(5): 1--36 ACM New York, NY, USA