A New Multiple Imputation Approach Using Machine Learning to Enhance Climate Databases in Senegal

Author:

Toure Mory1,Klutse Nana Ama Browne2,Sarr Mamadou Adama3,Kenne Annine Duclaire4,Bhuiyanr Md Abul Ehsan5,Ndiaye Ousmane1,Badiane Daouda6,Thiaw Wassila Mamadou5,Sy Ibrahima6,Mbow Cheikh7,Sall Saïdou Moustapha6,Gaye Amadou Thierno6

Affiliation:

1. Agence Nationale de l'Aviation Civile et de la Météorologie

2. University of Ghana

3. Université Gaston Berger

4. Johannes Kepler University of Linz

5. Climate Prediction Center, National Oceanic & Atmospheric Administration (NOAA)

6. Cheikh Anta Diop University

7. Centre de Suivi Écologique

Abstract

Abstract This study aims at enhancing climate data in Senegal using information from the Global Surface Summary of the Day (GSOD). It uses data from 1991 to 2022 from major secondary synoptic stations in Senegal. These data are subject to missing values (data gaps). To address these gaps, multiple imputation was used based on three machine learning models: PMM (Predictive Mean Matching), RF (Random Forest), and NORM (Bayesian Linear Regression). The PMM model relies on averages of similar data, the RF model handles complex relationships between variables, even on an intra-seasonal scale, while the NORM model captures seasonal variations and extreme values. The results highlight the higher performance of the RF model in terms of accuracy and variance explanation compared to the others. The findings of this study open new avenues for informed decision-making in sectors such as agriculture and urban planning, where accurate climate data play a crucial role. However, while this study lays the groundwork for better utilization of climate data in Senegal, challenges persist, including the ongoing need to collect high-quality data and adapt models to data intricacies.

Publisher

Research Square Platform LLC

Reference66 articles.

1. Assessing the Performance Gap of Climate Change on Buildings Design Analytical Stages Using Future Weather Projections;Alhindawi I;Environ Clim Technol,2020

2. Multiple imputation by chained equations: What is it and how does it work?;Azur MJ;Int J Methods Psychiatr Res,2011

3. Identifying Finest Machine Learning Algorithm for Climate Data Imputation in the State of Minas Gerais, Brazil;Bayma LO;J Inform Data Manage,2018

4. Random forest in remote sensing: A review of applications and future directions;Belgiu M;ISPRS J photogrammetry remote Sens,2016

5. Impact of performance appraisal on job performance of employees in private sector universities of developing countries;Bilal H;Public Policy and Administration Research,2014

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3