Affiliation:
1. Centro de Investigación Para el Territorio y el Hábitat Sostenible (CITEHS), Universidad Indoamérica, Quito 170301, Ecuador
2. Research Unit Sustainability and Climate Risks, Universität Hamburg, 20144 Hamburg, Germany
Abstract
Ensuring food security requires the publication of data in a timely manner, but often this information is not properly documented and evaluated. Therefore, the combination of databases from multiple sources is a common practice to curate the data and corroborate the results; however, this also results in incomplete cases. These tasks are often labor-intensive since they require a case-wise review to obtain the requested and completed information. To address these problems, an approach based on Selenium web-scraping software and the multiple imputation denoising autoencoders (MIDAS) algorithm is presented for a case study in Ecuador. The objective was to produce a multidimensional database, free of data gaps, with 72 species of food crops based on the data from 3 different open data web databases. This methodology resulted in an analysis-ready dataset with 43 parameters describing plant traits, nutritional composition, and planted areas of food crops, whose imputed data obtained an R-square of 0.84 for a control numerical parameter selected for validation. This enriched dataset was later clustered with K-means to report unprecedented insights into food crops cultivated in Ecuador. The methodology is useful for users who need to collect and curate data from different sources in a semi-automatic fashion.
Subject
Plant Science,Agronomy and Crop Science,Food Science
Reference84 articles.
1. Food Security: The Challenge of the Present;Prosekov;Geoforum,2018
2. Bridging the Food Security Gap: An Information-Led Approach to Connect Dietary Nutrition, Food Composition and Crop Production;Barkla;J. Sci. Food Agric.,2020
3. Trading-off Fish Biodiversity, Food Security, and Hydropower in the Mekong River Basin;Ziv;Proc. Natl. Acad. Sci. USA,2012
4. Little, R.J., and Rubin, D.B. (2019). Statistical Analysis with Missing Data, John Wiley & Sons.
5. Agricultural Development in Ecuador: A Compromise between Water and Food Security?;Salmoral;J. Clean. Prod.,2018
Cited by
2 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献