A Hybrid Sobolev Gradient Method for Learning NODEs

Baravdish, George; Eilertsen, Gabriel; Jaroudi, Rym; Johansson, B. Tomas; Malý, Lukáš; Unger, Jonas

A Hybrid Sobolev Gradient Method for Learning NODEs

George Baravdish (), Gabriel Eilertsen (), Rym Jaroudi (), B. Tomas Johansson (), Lukáš Malý () and Jonas Unger ()
Additional contact information
George Baravdish: Linköping University
Gabriel Eilertsen: Linköping University
Rym Jaroudi: Linköping University
B. Tomas Johansson: Linköping University
Lukáš Malý: Linköping University
Jonas Unger: Linköping University

SN Operations Research Forum, 2024, vol. 5, issue 4, 1-39

Abstract: Abstract The inverse problem of supervised reconstruction of depth-variable (time-dependent) parameters in ordinary differential equations is considered, with the typical application of finding weights of a neural ordinary differential equation (NODE) for a residual network with time continuous layers. The differential equation is treated as an abstract and isolated entity, termed a standalone NODE (sNODE), to facilitate for a wide range of applications. The proposed parameter reconstruction is performed by minimizing a cost functional covering a variety of loss functions and penalty terms. Regularization via penalty terms is incorporated to enhance ethical and trustworthy AI formulations. A nonlinear conjugate gradient mini-batch optimization scheme (NCG) is derived for the training having the benefit of including a sensitivity problem. The model (differential equation)-based approach is thus combined with a data-driven learning procedure. Mathematical properties are stated for the differential equation and the cost functional. The adjoint problem needed is derived together with the sensitivity problem. The sensitivity problem itself can estimate changes in the output under perturbation of the trained parameters. To preserve smoothness during the iterations, the Sobolev gradient is calculated and incorporated. Numerical results are included to validate the procedure for a NODE and synthetic datasets and compared with standard gradient approaches. For stability, using the sensitivity problem, a strategy for adversarial attacks is constructed, and it is shown that the given method with Sobolev gradients is more robust than standard approaches for parameter identification.

Keywords: Adversarial attacks; Deep learning; Inverse problems; Neural ordinary differential equations; Sobolev gradient (search for similar items in EconPapers)
Date: 2024
References: View references in EconPapers View complete reference list from CitEc
Citations:

Downloads: (external link)
http://link.springer.com/10.1007/s43069-024-00377-x Abstract (text/html)
Access to the full text of the articles in this series is restricted.

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:spr:snopef:v:5:y:2024:i:4:d:10.1007_s43069-024-00377-x

Ordering information: This journal article can be ordered from
https://www.springer.com/journal/43069

DOI: 10.1007/s43069-024-00377-x

Access Statistics for this article

SN Operations Research Forum is currently edited by Marco Lübbecke

More articles in SN Operations Research Forum from Springer
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().