A Stochastic Subgradient Method for Distributionally Robust Non-convex and Non-smooth Learning

Gürbüzbalaban, Mert; Ruszczyński, Andrzej; Zhu, Landi

A Stochastic Subgradient Method for Distributionally Robust Non-convex and Non-smooth Learning

Mert Gürbüzbalaban (), Andrzej Ruszczyński () and Landi Zhu ()
Additional contact information
Mert Gürbüzbalaban: Rutgers University
Andrzej Ruszczyński: Rutgers University
Landi Zhu: Rutgers University

Journal of Optimization Theory and Applications, 2022, vol. 194, issue 3, No 11, 1014-1041

Abstract: Abstract We consider a distributionally robust formulation of stochastic optimization problems arising in statistical learning, where robustness is with respect to ambiguity in the underlying data distribution. Our formulation builds on risk-averse optimization techniques and the theory of coherent risk measures. It uses mean–semideviation risk for quantifying uncertainty, allowing us to compute solutions that are robust against perturbations in the population data distribution. We consider a broad class of generalized differentiable loss functions that can be non-convex and non-smooth, involving upward and downward cusps, and we develop an efficient stochastic subgradient method for distributionally robust problems with such functions. We prove that it converges to a point satisfying the optimality conditions. To our knowledge, this is the first method with rigorous convergence guarantees in the context of generalized differentiable non-convex and non-smooth distributionally robust stochastic optimization. Our method allows for the control of the desired level of robustness with little extra computational cost compared to population risk minimization with stochastic gradient methods. We also illustrate the performance of our algorithm on real datasets arising in convex and non-convex supervised learning problems.

Keywords: Robust learning; Risk measures; Stochastic subgradient method; Non-smooth optimization; Composition optimization; 90C15; 90C48 (search for similar items in EconPapers)
Date: 2022
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (1)

Downloads: (external link)
http://link.springer.com/10.1007/s10957-022-02063-6 Abstract (text/html)
Access to the full text of the articles in this series is restricted.

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:spr:joptap:v:194:y:2022:i:3:d:10.1007_s10957-022-02063-6

Ordering information: This journal article can be ordered from
http://www.springer. ... cs/journal/10957/PS2

DOI: 10.1007/s10957-022-02063-6

Access Statistics for this article

Journal of Optimization Theory and Applications is currently edited by Franco Giannessi and David G. Hull

More articles in Journal of Optimization Theory and Applications from Springer
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().