Author:
Hooton Forrest,Menichetti Giulia,Barabási Albert-László
Abstract
AbstractThanks to the many chemical and nutritional components it carries, diet critically affects human health. However, the currently available comprehensive databases on food composition cover only a tiny fraction of the total number of chemicals present in our food, focusing on the nutritional components essential for our health. Indeed, thousands of other molecules, many of which have well documented health implications, remain untracked. To explore the body of knowledge available on food composition, we built FoodMine, an algorithm that uses natural language processing to identify papers from PubMed that potentially report on the chemical composition of garlic and cocoa. After extracting from each paper information on the reported quantities of chemicals, we find that the scientific literature carries extensive information on the detailed chemical components of food that is currently not integrated in databases. Finally, we use unsupervised machine learning to create chemical embeddings, finding that the chemicals identified by FoodMine tend to have direct health relevance, reflecting the scientific community’s focus on health-related chemicals in our food.
Funder
American Heart Association
National Institutes of Health
European Research Council
Publisher
Springer Science and Business Media LLC
Reference39 articles.
1. USDA. National Nutrient Database for Standard Reference, Release 28 (2015) Documentation and User Guide. 28, (2015).
2. Bhagwat, S., Haytowitz, D. B. & Holden, J. M. USDA database for the flavonoid content of selected foods release 3. U.S. Dep. Agric. 1–156. https://www.ars.usda.gov/ARSUserFiles/80400525/Data/Flav/Flav_R03-1.pdf (2011).
3. FooDB. https://foodb.ca/. Accessed 25 June 2019.
4. National Food Institute. Frida Food Data, version 1. Technical University of Denmark (2015). https://frida.fooddata.dk.
5. U.S. Department of Agriculture, A. R. S. Dr. Duke’s Phytochemical and Ethnobotanical Databases. (1992). https://doi.org/10.15482/USDA.ADC/1239279.
Cited by
22 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献