Linearized ADMM Converges to Second-Order Stationary Points for Non-Convex Problems

Songtao Lu; Jason Lee; Meisam Razaviyayn; Mingyi Hong

doi:10.1109/TSP.2021.3100976

Linearized ADMM Converges to Second-Order Stationary Points for Non-Convex Problems

Songtao Lu, Jason Lee, Meisam Razaviyayn, Mingyi Hong

Research output: Contribution to journal › Article › peer-review

6 Scopus citations

Abstract

In this work, a gradient-based primal-dual method of multipliers is proposed for solving a class of linearly constrained non-convex problems. We show that with random initialization of the primal and dual variables, the algorithm is able to compute second-order stationary points (SOSPs) with probability one. Further, we present applications of the proposed method in popular signal processing and machine learning problems such as decentralized matrix factorization and decentralized training of overparameterized neural networks. One of the key steps in the analysis is to construct a new loss function for these problems such that the required convergence conditions (especially the gradient Lipschitz conditions) can be satisfied without changing the global optimal points.

Original language	English (US)
Article number	9503322
Pages (from-to)	4859-4874
Number of pages	16
Journal	IEEE Transactions on Signal Processing
Volume	69
DOIs	https://doi.org/10.1109/TSP.2021.3100976
State	Published - 2021
Externally published	Yes

Bibliographical note

Funding Information:
Manuscript received July 8, 2020; revised April 13, 2021; accepted May 25, 2021. Date of publication August 2, 2021; date of current version September 3, 2021. The associate editor coordinating the review of this manuscript and approving it for publication was Dr. V. Gripon. The work of Minggyi Hong was supported in part by the National Science Foundation under Grants CIF-1910385 and CNS-2003033, and in part by AFOSR under Grant 19RT0424. This paper was presented in part at the International Conference on Machine Learning, Stockholm, Sweden [1]. (Corresponding author: Mingyi Hong.) Songtao Lu is with the IBM Thomas J. Watson Research Center, Yorktown Heights, NY 10598 USA (e-mail: songtao@ibm.com).

Publisher Copyright:
© 1991-2012 IEEE.

Keywords

First-order stationary points (FOSPs)
alternating direction method of multipliers (ADMM)
neural networks
non-convex optimization
second-order stationary points (SOSPs)

Access

10.1109/TSP.2021.3100976

OpenUrl availability

Full text

Cite this

@article{a30dab840d5440f3ad15242bfb92f6e7,

title = "Linearized ADMM Converges to Second-Order Stationary Points for Non-Convex Problems",

abstract = "In this work, a gradient-based primal-dual method of multipliers is proposed for solving a class of linearly constrained non-convex problems. We show that with random initialization of the primal and dual variables, the algorithm is able to compute second-order stationary points (SOSPs) with probability one. Further, we present applications of the proposed method in popular signal processing and machine learning problems such as decentralized matrix factorization and decentralized training of overparameterized neural networks. One of the key steps in the analysis is to construct a new loss function for these problems such that the required convergence conditions (especially the gradient Lipschitz conditions) can be satisfied without changing the global optimal points.",

keywords = "First-order stationary points (FOSPs), alternating direction method of multipliers (ADMM), neural networks, non-convex optimization, second-order stationary points (SOSPs)",

author = "Songtao Lu and Jason Lee and Meisam Razaviyayn and Mingyi Hong",

note = "Funding Information: Manuscript received July 8, 2020; revised April 13, 2021; accepted May 25, 2021. Date of publication August 2, 2021; date of current version September 3, 2021. The associate editor coordinating the review of this manuscript and approving it for publication was Dr. V. Gripon. The work of Minggyi Hong was supported in part by the National Science Foundation under Grants CIF-1910385 and CNS-2003033, and in part by AFOSR under Grant 19RT0424. This paper was presented in part at the International Conference on Machine Learning, Stockholm, Sweden [1]. (Corresponding author: Mingyi Hong.) Songtao Lu is with the IBM Thomas J. Watson Research Center, Yorktown Heights, NY 10598 USA (e-mail: songtao@ibm.com). Publisher Copyright: {\textcopyright} 1991-2012 IEEE.",

year = "2021",

doi = "10.1109/TSP.2021.3100976",

language = "English (US)",

volume = "69",

pages = "4859--4874",

journal = "IEEE Transactions on Signal Processing",

issn = "1053-587X",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

}

TY - JOUR

T1 - Linearized ADMM Converges to Second-Order Stationary Points for Non-Convex Problems

AU - Lu, Songtao

AU - Lee, Jason

AU - Razaviyayn, Meisam

AU - Hong, Mingyi

N1 - Funding Information: Manuscript received July 8, 2020; revised April 13, 2021; accepted May 25, 2021. Date of publication August 2, 2021; date of current version September 3, 2021. The associate editor coordinating the review of this manuscript and approving it for publication was Dr. V. Gripon. The work of Minggyi Hong was supported in part by the National Science Foundation under Grants CIF-1910385 and CNS-2003033, and in part by AFOSR under Grant 19RT0424. This paper was presented in part at the International Conference on Machine Learning, Stockholm, Sweden [1]. (Corresponding author: Mingyi Hong.) Songtao Lu is with the IBM Thomas J. Watson Research Center, Yorktown Heights, NY 10598 USA (e-mail: songtao@ibm.com). Publisher Copyright: © 1991-2012 IEEE.

PY - 2021

Y1 - 2021

N2 - In this work, a gradient-based primal-dual method of multipliers is proposed for solving a class of linearly constrained non-convex problems. We show that with random initialization of the primal and dual variables, the algorithm is able to compute second-order stationary points (SOSPs) with probability one. Further, we present applications of the proposed method in popular signal processing and machine learning problems such as decentralized matrix factorization and decentralized training of overparameterized neural networks. One of the key steps in the analysis is to construct a new loss function for these problems such that the required convergence conditions (especially the gradient Lipschitz conditions) can be satisfied without changing the global optimal points.

AB - In this work, a gradient-based primal-dual method of multipliers is proposed for solving a class of linearly constrained non-convex problems. We show that with random initialization of the primal and dual variables, the algorithm is able to compute second-order stationary points (SOSPs) with probability one. Further, we present applications of the proposed method in popular signal processing and machine learning problems such as decentralized matrix factorization and decentralized training of overparameterized neural networks. One of the key steps in the analysis is to construct a new loss function for these problems such that the required convergence conditions (especially the gradient Lipschitz conditions) can be satisfied without changing the global optimal points.

KW - First-order stationary points (FOSPs)

KW - alternating direction method of multipliers (ADMM)

KW - neural networks

KW - non-convex optimization

KW - second-order stationary points (SOSPs)

UR - http://www.scopus.com/inward/record.url?scp=85112651704&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85112651704&partnerID=8YFLogxK

U2 - 10.1109/TSP.2021.3100976

DO - 10.1109/TSP.2021.3100976

M3 - Article

AN - SCOPUS:85112651704

SN - 1053-587X

VL - 69

SP - 4859

EP - 4874

JO - IEEE Transactions on Signal Processing

JF - IEEE Transactions on Signal Processing

M1 - 9503322

ER -

Linearized ADMM Converges to Second-Order Stationary Points for Non-Convex Problems

Abstract

Bibliographical note

Keywords

Access

OpenUrl availability

Other files and links

Fingerprint

Cite this