Streamlining Ocean Dynamics Modeling with Fourier Neural Operators: A Multiobjective Hyperparameter and Architecture Optimization Approach
Yixuan Sun,
Ololade Sowunmi,
Romain Egele,
Sri Hari Krishna Narayanan,
Luke Van Roekel and
Prasanna Balaprakash ()
Additional contact information
Yixuan Sun: Argonne National Laboratory, Lemont, IL 60439, USA
Ololade Sowunmi: Department of Mathematics, Florida State University, Tallahassee, FL 32304, USA
Romain Egele: Argonne National Laboratory, Lemont, IL 60439, USA
Sri Hari Krishna Narayanan: Argonne National Laboratory, Lemont, IL 60439, USA
Luke Van Roekel: Los Alamos National Laboratory, Los Alamos, NM 87545, USA
Prasanna Balaprakash: Oak Ridge National Laboratory, Oak Ridge, TN 37830, USA
Mathematics, 2024, vol. 12, issue 10, 1-17
Abstract:
Training an effective deep learning model to learn ocean processes involves careful choices of various hyperparameters. We leverage DeepHyper’s advanced search algorithms for multiobjective optimization, streamlining the development of neural networks tailored for ocean modeling. The focus is on optimizing Fourier neural operators (FNOs), a data-driven model capable of simulating complex ocean behaviors. Selecting the correct model and tuning the hyperparameters are challenging tasks, requiring much effort to ensure model accuracy. DeepHyper allows efficient exploration of hyperparameters associated with data preprocessing, FNO architecture-related hyperparameters, and various model training strategies. We aim to obtain an optimal set of hyperparameters leading to the most performant model. Moreover, on top of the commonly used mean squared error for model training, we propose adopting the negative anomaly correlation coefficient as the additional loss term to improve model performance and investigate the potential trade-off between the two terms. The numerical experiments show that the optimal set of hyperparameters enhanced model performance in single timestepping forecasting and greatly exceeded the baseline configuration in the autoregressive rollout for long-horizon forecasting up to 30 days. Utilizing DeepHyper, we demonstrate an approach to enhance the use of FNO in ocean dynamics forecasting, offering a scalable solution with improved precision.
Keywords: ocean modeling; operator learning; hyperparameter optimization (search for similar items in EconPapers)
JEL-codes: C (search for similar items in EconPapers)
Date: 2024
References: View references in EconPapers View complete reference list from CitEc
Citations:
Downloads: (external link)
https://www.mdpi.com/2227-7390/12/10/1483/pdf (application/pdf)
https://www.mdpi.com/2227-7390/12/10/1483/ (text/html)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:gam:jmathe:v:12:y:2024:i:10:p:1483-:d:1391804
Access Statistics for this article
Mathematics is currently edited by Ms. Emma He
More articles in Mathematics from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().