Streamlining Ocean Dynamics Modeling with Fourier Neural Operators: A Multiobjective Hyperparameter and Architecture Optimization Approach
-
Published:2024-05-10
Issue:10
Volume:12
Page:1483
-
ISSN:2227-7390
-
Container-title:Mathematics
-
language:en
-
Short-container-title:Mathematics
Author:
Sun Yixuan1ORCID, Sowunmi Ololade2, Egele Romain13ORCID, Narayanan Sri Hari Krishna1ORCID, Van Roekel Luke4ORCID, Balaprakash Prasanna5ORCID
Affiliation:
1. Argonne National Laboratory, Lemont, IL 60439, USA 2. Department of Mathematics, Florida State University, Tallahassee, FL 32304, USA 3. Laboratoire Interdisciplinaire des Sciences du Numérique, Université Paris-Saclay, 91190 Gif-sur-Yvette, France 4. Los Alamos National Laboratory, Los Alamos, NM 87545, USA 5. Oak Ridge National Laboratory, Oak Ridge, TN 37830, USA
Abstract
Training an effective deep learning model to learn ocean processes involves careful choices of various hyperparameters. We leverage DeepHyper’s advanced search algorithms for multiobjective optimization, streamlining the development of neural networks tailored for ocean modeling. The focus is on optimizing Fourier neural operators (FNOs), a data-driven model capable of simulating complex ocean behaviors. Selecting the correct model and tuning the hyperparameters are challenging tasks, requiring much effort to ensure model accuracy. DeepHyper allows efficient exploration of hyperparameters associated with data preprocessing, FNO architecture-related hyperparameters, and various model training strategies. We aim to obtain an optimal set of hyperparameters leading to the most performant model. Moreover, on top of the commonly used mean squared error for model training, we propose adopting the negative anomaly correlation coefficient as the additional loss term to improve model performance and investigate the potential trade-off between the two terms. The numerical experiments show that the optimal set of hyperparameters enhanced model performance in single timestepping forecasting and greatly exceeded the baseline configuration in the autoregressive rollout for long-horizon forecasting up to 30 days. Utilizing DeepHyper, we demonstrate an approach to enhance the use of FNO in ocean dynamics forecasting, offering a scalable solution with improved precision.
Funder
Argonne Leadership Computing Facility at Argonne National Laboratory U.S. Department of Energy
Reference51 articles.
1. Kurth, T., Treichler, S., Romero, J., Mudigonda, M., Luehr, N., Phillips, E., Mahesh, A., Matheson, M., Deslippe, J., and Fatica, M. (2018, January 11–16). Exascale deep learning for climate analytics. Proceedings of the SC18: International Conference for High Performance Computing, Computing, Networking, Storage and Analysis, Dallas, TX, USA. 2. Deep learning to represent subgrid processes in climate models;Rasp;Proc. Natl. Acad. Sci. USA,2018 3. Nguyen, T., Brandstetter, J., Kapoor, A., Gupta, J.K., and Grover, A. (2023). ClimaX: A foundation model for weather and climate. arXiv. 4. Training machine learning models on climate model output yields skillful interpretable seasonal precipitation forecasts;Gibson;Commun. Earth Environ.,2021 5. Pathak, J., Subramanian, S., Harrington, P., Raja, S., Chattopadhyay, A., Mardani, M., Kurth, T., Hall, D., Li, Z., and Azizzadenesheli, K. (2022). FourCastNet: A global data-driven high-resolution weather model using adaptive Fourier neural operators. arXiv.
|
|