A randomized nonmonotone block proximal gradient method for a class of structured nonlinear programming

Zhaosong Lu; Lin Xiao

doi:10.1137/16M1110182

A randomized nonmonotone block proximal gradient method for a class of structured nonlinear programming

Zhaosong Lu, Lin Xiao

Research output: Contribution to journal › Article › peer-review

5 Scopus citations

Abstract

We propose a randomized nonmonotone block proximal gradient (RNBPG) method for minimizing the sum of a smooth (possibly nonconvex) function and a block-separable (possibly nonconvex nonsmooth) function. At each iteration, this method randomly picks a block according to any prescribed probability distribution and solves typically several associated proximal subproblems that usually have a closed-form solution, until a certain progress on objective value is achieved. In contrast to the usual randomized block coordinate descent method [P. Richtarik and M. Takac, Math. Program., 144 (2014), pp. 1-38; A. Patrascu and I. Necoara, J. Global Optim., 61 (2015), pp. 19-46], our method has a nonmonotone avor and uses variable stepsizes that can partially utilize the local curvature information of the smooth component of objective function. We show that any accumulation point of the solution sequence of the method is a stationary point of the problem almost surely and the method is capable of finding an approximate stationary point with high probability. We also establish a sublinear rate of convergence for the method in terms of the minimal expected squared norm of certain proximal gradients over the iterations. When the problem under consideration is convex, we show that the expected objective values generated by RNBPG converge to the optimal value of the problem. Under some assumptions, we further establish a sublinear and linear rate of convergence on the expected objective values generated by a monotone version of RNBPG. Finally, we conduct some preliminary experiments to test the performance of RNBPG on the 1-regularized least-squares problem, a dual support vector machine problem in machine learning, the 0-regularized least-squares problem, and a regularized matrix completion model. The computational results demonstrate that our method substantially outperforms the randomized block coordinate descent method with fixed or variable stepsizes.

Original language	English (US)
Pages (from-to)	2930-2955
Number of pages	26
Journal	SIAM Journal on Numerical Analysis
Volume	55
Issue number	6
DOIs	https://doi.org/10.1137/16M1110182
State	Published - 2017
Externally published	Yes

Bibliographical note

Funding Information:
∗Received by the editors January 3, 2017; accepted for publication April 28, 2017; published electronically November 28, 2017. http://www.siam.org/journals/sinum/55-6/M111018.html Funding: The work of the first author was supported in part by an NSERC Discovery Grant. †Department of Mathematics, Simon Fraser University, Burnaby, BC, V5A 1S6, Canada (zhaosong@sfu.ca). ‡Machine Learning Groups, Microsoft Research, One Microsoft Way, Redmond, WA 98052 (lin.xiao@microsoft.com).

Publisher Copyright:
© 2017 Society for Industrial and Applied Mathematics.

Keywords

Block coordinate gradient method
Nonconvex composite optimization
Nonmonotone line search
Randomized algorithms

Access

10.1137/16M1110182

OpenUrl availability

Full text

Cite this

@article{b72a9f5fbb924a07a6ba8eb8d147422c,

title = "A randomized nonmonotone block proximal gradient method for a class of structured nonlinear programming",

abstract = "We propose a randomized nonmonotone block proximal gradient (RNBPG) method for minimizing the sum of a smooth (possibly nonconvex) function and a block-separable (possibly nonconvex nonsmooth) function. At each iteration, this method randomly picks a block according to any prescribed probability distribution and solves typically several associated proximal subproblems that usually have a closed-form solution, until a certain progress on objective value is achieved. In contrast to the usual randomized block coordinate descent method [P. Richtarik and M. Takac, Math. Program., 144 (2014), pp. 1-38; A. Patrascu and I. Necoara, J. Global Optim., 61 (2015), pp. 19-46], our method has a nonmonotone avor and uses variable stepsizes that can partially utilize the local curvature information of the smooth component of objective function. We show that any accumulation point of the solution sequence of the method is a stationary point of the problem almost surely and the method is capable of finding an approximate stationary point with high probability. We also establish a sublinear rate of convergence for the method in terms of the minimal expected squared norm of certain proximal gradients over the iterations. When the problem under consideration is convex, we show that the expected objective values generated by RNBPG converge to the optimal value of the problem. Under some assumptions, we further establish a sublinear and linear rate of convergence on the expected objective values generated by a monotone version of RNBPG. Finally, we conduct some preliminary experiments to test the performance of RNBPG on the 1-regularized least-squares problem, a dual support vector machine problem in machine learning, the 0-regularized least-squares problem, and a regularized matrix completion model. The computational results demonstrate that our method substantially outperforms the randomized block coordinate descent method with fixed or variable stepsizes.",

keywords = "Block coordinate gradient method, Nonconvex composite optimization, Nonmonotone line search, Randomized algorithms",

author = "Zhaosong Lu and Lin Xiao",

note = "Funding Information: ∗Received by the editors January 3, 2017; accepted for publication April 28, 2017; published electronically November 28, 2017. http://www.siam.org/journals/sinum/55-6/M111018.html Funding: The work of the first author was supported in part by an NSERC Discovery Grant. †Department of Mathematics, Simon Fraser University, Burnaby, BC, V5A 1S6, Canada (zhaosong@sfu.ca). ‡Machine Learning Groups, Microsoft Research, One Microsoft Way, Redmond, WA 98052 (lin.xiao@microsoft.com). Publisher Copyright: {\textcopyright} 2017 Society for Industrial and Applied Mathematics.",

year = "2017",

doi = "10.1137/16M1110182",

language = "English (US)",

volume = "55",

pages = "2930--2955",

journal = "SIAM Journal on Numerical Analysis",

issn = "0036-1429",

publisher = "Society for Industrial and Applied Mathematics Publications",

number = "6",

}

TY - JOUR

T1 - A randomized nonmonotone block proximal gradient method for a class of structured nonlinear programming

AU - Lu, Zhaosong

AU - Xiao, Lin

N1 - Funding Information: ∗Received by the editors January 3, 2017; accepted for publication April 28, 2017; published electronically November 28, 2017. http://www.siam.org/journals/sinum/55-6/M111018.html Funding: The work of the first author was supported in part by an NSERC Discovery Grant. †Department of Mathematics, Simon Fraser University, Burnaby, BC, V5A 1S6, Canada (zhaosong@sfu.ca). ‡Machine Learning Groups, Microsoft Research, One Microsoft Way, Redmond, WA 98052 (lin.xiao@microsoft.com). Publisher Copyright: © 2017 Society for Industrial and Applied Mathematics.

PY - 2017

Y1 - 2017

N2 - We propose a randomized nonmonotone block proximal gradient (RNBPG) method for minimizing the sum of a smooth (possibly nonconvex) function and a block-separable (possibly nonconvex nonsmooth) function. At each iteration, this method randomly picks a block according to any prescribed probability distribution and solves typically several associated proximal subproblems that usually have a closed-form solution, until a certain progress on objective value is achieved. In contrast to the usual randomized block coordinate descent method [P. Richtarik and M. Takac, Math. Program., 144 (2014), pp. 1-38; A. Patrascu and I. Necoara, J. Global Optim., 61 (2015), pp. 19-46], our method has a nonmonotone avor and uses variable stepsizes that can partially utilize the local curvature information of the smooth component of objective function. We show that any accumulation point of the solution sequence of the method is a stationary point of the problem almost surely and the method is capable of finding an approximate stationary point with high probability. We also establish a sublinear rate of convergence for the method in terms of the minimal expected squared norm of certain proximal gradients over the iterations. When the problem under consideration is convex, we show that the expected objective values generated by RNBPG converge to the optimal value of the problem. Under some assumptions, we further establish a sublinear and linear rate of convergence on the expected objective values generated by a monotone version of RNBPG. Finally, we conduct some preliminary experiments to test the performance of RNBPG on the 1-regularized least-squares problem, a dual support vector machine problem in machine learning, the 0-regularized least-squares problem, and a regularized matrix completion model. The computational results demonstrate that our method substantially outperforms the randomized block coordinate descent method with fixed or variable stepsizes.

AB - We propose a randomized nonmonotone block proximal gradient (RNBPG) method for minimizing the sum of a smooth (possibly nonconvex) function and a block-separable (possibly nonconvex nonsmooth) function. At each iteration, this method randomly picks a block according to any prescribed probability distribution and solves typically several associated proximal subproblems that usually have a closed-form solution, until a certain progress on objective value is achieved. In contrast to the usual randomized block coordinate descent method [P. Richtarik and M. Takac, Math. Program., 144 (2014), pp. 1-38; A. Patrascu and I. Necoara, J. Global Optim., 61 (2015), pp. 19-46], our method has a nonmonotone avor and uses variable stepsizes that can partially utilize the local curvature information of the smooth component of objective function. We show that any accumulation point of the solution sequence of the method is a stationary point of the problem almost surely and the method is capable of finding an approximate stationary point with high probability. We also establish a sublinear rate of convergence for the method in terms of the minimal expected squared norm of certain proximal gradients over the iterations. When the problem under consideration is convex, we show that the expected objective values generated by RNBPG converge to the optimal value of the problem. Under some assumptions, we further establish a sublinear and linear rate of convergence on the expected objective values generated by a monotone version of RNBPG. Finally, we conduct some preliminary experiments to test the performance of RNBPG on the 1-regularized least-squares problem, a dual support vector machine problem in machine learning, the 0-regularized least-squares problem, and a regularized matrix completion model. The computational results demonstrate that our method substantially outperforms the randomized block coordinate descent method with fixed or variable stepsizes.

KW - Block coordinate gradient method

KW - Nonconvex composite optimization

KW - Nonmonotone line search

KW - Randomized algorithms

UR - http://www.scopus.com/inward/record.url?scp=85039949923&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85039949923&partnerID=8YFLogxK

U2 - 10.1137/16M1110182

DO - 10.1137/16M1110182

M3 - Article

AN - SCOPUS:85039949923

SN - 0036-1429

VL - 55

SP - 2930

EP - 2955

JO - SIAM Journal on Numerical Analysis

JF - SIAM Journal on Numerical Analysis

IS - 6

ER -

A randomized nonmonotone block proximal gradient method for a class of structured nonlinear programming

Abstract

Bibliographical note

Keywords

Access

OpenUrl availability

Other files and links

Fingerprint

Cite this