Generalization Bounds for Stochastic Saddle Point Problems

Junyu Zhang; Mingyi Hong; Mengdi Wang; Shuzhong Zhang

Generalization Bounds for Stochastic Saddle Point Problems

Junyu Zhang, Mingyi Hong, Mengdi Wang, Shuzhong Zhang

Research output: Contribution to journal › Conference article › peer-review

Abstract

This paper studies the generalization bounds for the empirical saddle point (ESP) solution to stochastic saddle point (SSP) problems. For SSP with Lipschitz continuous and strongly convex-strongly concave objective functions, we establish an O (1/n) generalization bound by using a probabilistic stability argument. We also provide generalization bounds under a variety of assumptions, including the cases without strong convexity and without bounded domains. We illustrate our results in three examples: batch policy learning in Markov decision process, stochastic composite optimization problem, and mixed strategy Nash equilibrium estimation for stochastic games. In each of these examples, we show that a regularized ESP solution enjoys a near-optimal sample complexity. To the best of our knowledge, this is the first set of results on the generalization theory of ESP.

Original language	English (US)
Pages (from-to)	568-576
Number of pages	9
Journal	Proceedings of Machine Learning Research
Volume	130
State	Published - 2021
Event	24th International Conference on Artificial Intelligence and Statistics, AISTATS 2021 - Virtual, Online, United States Duration: Apr 13 2021 → Apr 15 2021

Bibliographical note

Publisher Copyright:
Copyright © 2021 by the author(s)

OpenUrl availability

Full text

Cite this

@article{b7756b74ffc04c55b5f5156ef31f03f5,

title = "Generalization Bounds for Stochastic Saddle Point Problems",

abstract = "This paper studies the generalization bounds for the empirical saddle point (ESP) solution to stochastic saddle point (SSP) problems. For SSP with Lipschitz continuous and strongly convex-strongly concave objective functions, we establish an O (1/n) generalization bound by using a probabilistic stability argument. We also provide generalization bounds under a variety of assumptions, including the cases without strong convexity and without bounded domains. We illustrate our results in three examples: batch policy learning in Markov decision process, stochastic composite optimization problem, and mixed strategy Nash equilibrium estimation for stochastic games. In each of these examples, we show that a regularized ESP solution enjoys a near-optimal sample complexity. To the best of our knowledge, this is the first set of results on the generalization theory of ESP.",

author = "Junyu Zhang and Mingyi Hong and Mengdi Wang and Shuzhong Zhang",

note = "Publisher Copyright: Copyright {\textcopyright} 2021 by the author(s); 24th International Conference on Artificial Intelligence and Statistics, AISTATS 2021 ; Conference date: 13-04-2021 Through 15-04-2021",

year = "2021",

language = "English (US)",

volume = "130",

pages = "568--576",

journal = "Proceedings of Machine Learning Research",

issn = "2640-3498",

}

TY - JOUR

T1 - Generalization Bounds for Stochastic Saddle Point Problems

AU - Zhang, Junyu

AU - Hong, Mingyi

AU - Wang, Mengdi

AU - Zhang, Shuzhong

PY - 2021

Y1 - 2021

N2 - This paper studies the generalization bounds for the empirical saddle point (ESP) solution to stochastic saddle point (SSP) problems. For SSP with Lipschitz continuous and strongly convex-strongly concave objective functions, we establish an O (1/n) generalization bound by using a probabilistic stability argument. We also provide generalization bounds under a variety of assumptions, including the cases without strong convexity and without bounded domains. We illustrate our results in three examples: batch policy learning in Markov decision process, stochastic composite optimization problem, and mixed strategy Nash equilibrium estimation for stochastic games. In each of these examples, we show that a regularized ESP solution enjoys a near-optimal sample complexity. To the best of our knowledge, this is the first set of results on the generalization theory of ESP.

AB - This paper studies the generalization bounds for the empirical saddle point (ESP) solution to stochastic saddle point (SSP) problems. For SSP with Lipschitz continuous and strongly convex-strongly concave objective functions, we establish an O (1/n) generalization bound by using a probabilistic stability argument. We also provide generalization bounds under a variety of assumptions, including the cases without strong convexity and without bounded domains. We illustrate our results in three examples: batch policy learning in Markov decision process, stochastic composite optimization problem, and mixed strategy Nash equilibrium estimation for stochastic games. In each of these examples, we show that a regularized ESP solution enjoys a near-optimal sample complexity. To the best of our knowledge, this is the first set of results on the generalization theory of ESP.

UR - http://www.scopus.com/inward/record.url?scp=85161838276&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85161838276&partnerID=8YFLogxK

M3 - Conference article

AN - SCOPUS:85161838276

SN - 2640-3498

VL - 130

SP - 568

EP - 576

JO - Proceedings of Machine Learning Research

JF - Proceedings of Machine Learning Research

T2 - 24th International Conference on Artificial Intelligence and Statistics, AISTATS 2021

Y2 - 13 April 2021 through 15 April 2021

ER -

Generalization Bounds for Stochastic Saddle Point Problems

Abstract

Bibliographical note

OpenUrl availability

Other files and links

Fingerprint

Cite this