Convergence Rate of Inertial Proximal Algorithms with General Extrapolation and Proximal Coefficients

Attouch, Hedy; Chbani, Zaki; Riahi, Hassan

doi:10.1007/s10013-020-00399-y

Convergence Rate of Inertial Proximal Algorithms with General Extrapolation and Proximal Coefficients

Original Article
Published: 29 April 2020

Volume 48, pages 247–276, (2020)
Cite this article

Vietnam Journal of Mathematics Aims and scope Submit manuscript

Hedy Attouch¹,
Zaki Chbani² &
Hassan Riahi²

306 Accesses
6 Citations
Explore all metrics

Abstract

In a Hilbert space setting ${\mathcal{H}}$, in order to minimize by fast methods a general convex lower semicontinuous and proper function ${\Phi }: {\mathcal{H}} \rightarrow \mathbb {R} \cup \{+\infty \}$, we analyze the convergence rate of the inertial proximal algorithms. These algorithms involve both extrapolation coefficients (including Nesterov acceleration method) and proximal coefficients in a general form. They can be interpreted as the discrete time version of inertial continuous gradient systems with general damping and time scale coefficients. Based on the proper setting of these parameters, we show the fast convergence of values and the convergence of iterates. In doing so, we provide an overview of this class of algorithms. Our study complements the previous Attouch–Cabot paper (SIOPT, 2018) by introducing into the algorithm time scaling aspects, and sheds new light on the Güler seminal papers on the convergence rate of the accelerated proximal methods for convex optimization.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Convergence of a relaxed inertial proximal algorithm for maximally monotone operators

Article 29 June 2019

Convergence Rate of Proximal Inertial Algorithms Associated with Moreau Envelopes of Convex Functions

New inertial proximal gradient methods for unconstrained convex optimization problems

Article Open access 07 December 2020

References

Alvarez, F., Attouch, H.: An inertial proximal method for maximal monotone operators via discretization of a nonlinear oscillator with damping. Set-Valued Anal. 9, 3–11 (2001)
Article MathSciNet Google Scholar
Álvarez, F., Attouch, H., Bolte, J., Redont, P.: A second-order gradient-like dissipative dynamical system with Hessian-driven damping. Application to optimization and mechanics. J. Math. Pures Appl. 81, 747–779 (2002)
Article MathSciNet Google Scholar
Apidopoulos, V., Aujol, J.-F., Dossal, Ch.: Convergence rate of inertial forward-backward algorithm beyond Nesterov’s rule. Math. Program. https://doi.org/10.1007/s10107-018-1350-9. HAL-01551873 (2018)
Attouch, H., Cabot, A.: Asymptotic stabilization of inertial gradient dynamics with time-dependent viscosity. J. Differ. Equ. 263, 5412–5458 (2017)
Article MathSciNet Google Scholar
Attouch, H., Cabot, A.: Convergence rates of inertial forward-backward algorithms. SIAM J. Optim. 28, 849–874 (2018)
Article MathSciNet Google Scholar
Attouch, H., Cabot, A., Chbani, Z., Riahi, H.: Inertial forward-backward algorithms with perturbations: application to Tikhonov regularization. J. Optim. Theory Appl. 179, 1–36 (2018)
Article MathSciNet Google Scholar
Attouch, H., Chbani, Z., Riahi, H.: Fast proximal methods via time scaling of damped inertial dynamics. SIAM. J. Optim. 29, 2227–2256 (2019)
MathSciNet MATH Google Scholar
Attouch, H., Chbani, Z., Peypouquet, J., Redont, P.: Fast convergence of inertial dynamics and algorithms with asymptotic vanishing viscosity. Math. Program. Ser. B 168, 123–175 (2018)
Article MathSciNet Google Scholar
Attouch, H., Chbani, Z., Riahi, H.: Rate of convergence of the Nesterov accelerated gradient method in the subcritical case α ≤ 3. ESAIM-COCV, 25 (2019). https://doi.org/10.1051/cocv/2017083
Attouch, H., Peypouquet, J.: The rate of convergence of Nesterov’s accelerated forward-backward method is actually faster than 1/k². SIAM. J. Optim. 26, 1824–1834 (2016)
MathSciNet MATH Google Scholar
Aujol, J. -F., Dossal, Ch: Stability of over-relaxations for the forward-backward algorithm, application to FISTA. SIAM J. Optim. 25, 2408–2433 (2015)
Article MathSciNet Google Scholar
Aujol, J.-F., Dossal, Ch.: Optimal rate of convergence of an ODE associated to the Fast Gradient Descent schemes for b > 0. https://hal.inria.fr/hal-01547251v2 (2017)
Bauschke, H.H., Combettes, P.L.: Convex Analysis and Monotone Operator Theory in Hilbert Spaces. CMS Books in Mathematics. Springer, Cham (2011)
Book Google Scholar
Beck, A., Teboulle, M.: A fast iterative shrinkage-thresholding algorithm for linear inverse problems. SIAM J. Imaging Sci. 2, 183–202 (2009)
Article MathSciNet Google Scholar
Boţ, R. I., Csetnek, E.R., László, S. C.: A second-order dynamical approach with variable damping to nonconvex smooth minimization. Appl. Anal. (2018). https://doi.org/10.1080/00036811.2018.1495330
Bonettini, S., Porta, F., Ruggiero, V.: A variable metric forward-backward method with extrapolation. SIAM. J. Sci. Comput. 38, A2558–A2584 (2016)
MATH Google Scholar
Burger, M., Sawatzky, A., Steidl, G.: First order algorithms in variational image processing. In: Glowinski, R., Osher, S., Yin, W (eds.) Splitting Methods in Communication, Imaging, Science, and Engineering, pp 345–407. Springer, Cham (2016)
Calatroni, L., Chambolle, A.: Backtracking strategies for accelerated descent methods with smooth composite objectives. SIAM J. Optim. 29, 1772–1798 (2019)
Article MathSciNet Google Scholar
Chambolle, A., Dossal, Ch: On the convergence of the iterates of the “Fast Iterative Shrinkage/Thresholding Algorithm”. J. Optim. Theory Appl. 166, 968–982 (2015)
Article MathSciNet Google Scholar
Combettes, P.L., Glaudin, L.E.: Proximal activation of smooth functions in splitting algorithms for convex image recovery. SIAM J. Imaging Sci. (2019). To appear
Combettes, P.L., Wajs, V.R.: Signal recovery by proximal forward-backward splitting. Multiscale Model Simul. 4, 1168–1200 (2005)
Article MathSciNet Google Scholar
Güler, O.: On the convergence of the proximal point algorithm for convex optimization. SIAM J. Control Optim. 29, 403–419 (1991)
Article MathSciNet Google Scholar
Güler, O.: New proximal point algorithms for convex minimization. SIAM J. Optim. 2, 649–664 (1992)
Article MathSciNet Google Scholar
Kim, D., Fessler, J.A.: Optimized first-order methods for smooth convex minimization. Math. Program. 159, 81–107 (2016)
Article MathSciNet Google Scholar
Liang, J., Fadili, J., Peyré, G.: Local linear convergence of forward-backward under partial smoothness. In: Ghahramani, Z., et al. (eds.) Advances in Neural Information Processing Systems 27, pp 1970–1978. Curran Associates Inc. (2014)
Lorenz, D.A., Pock, Th.: An inertial forward-backward algorithm for monotone inclusions. J. Math. Imaging Vis. 51, 311–325 (2015)
Article MathSciNet Google Scholar
May, R.: Asymptotic for a second-order evolution equation with convex potential and vanishing damping term. Turk. J. Math. 41, 681–685 (2017)
Article MathSciNet Google Scholar
Nesterov, Y.: A method of solving a convex programming problem with convergence rate O(1/k2). Soviet. Math. Dokl. 27, 372–376 (1983)
MATH Google Scholar
Nesterov, Y.: Introductory Lectures on Convex Optimization: A Basic Course. Applied Optimization, vol. 87. Kluwer Academic Publishers, Boston (2004)
Book Google Scholar
Opial, Z.: Weak convergence of the sequence of successive approximations for nonexpansive mappings. Bull. Am. Math. Soc. 73, 591–597 (1967)
Article MathSciNet Google Scholar
Parikh, N., Boyd, S.: Proximal algorithms. Foundations and Trends in optimization, vol. 1, pp. 127–239 (2013)
Peypouquet, J.: Convex Optimization in Normed Spaces: Theory, Methods and Examples. Springer, Cham (2015)
Book Google Scholar
Peypouquet, J., Sorin, S.: Evolution equations for maximal monotone operators: asymptotic analysis in continuous and discrete time. J. Convex Anal. 17, 1113–1163 (2010)
MathSciNet MATH Google Scholar
Polyak, B.T.: Some methods of speeding up the convergence of iteration methods. USSR Comput Math. Math. Phys. 4, 1–17 (1964)
Article Google Scholar
Polyak, B.T.: Introduction to Optimization. New York: Optimization Software (1987)
Scheinberg, K., Goldfarb, D., Bai, X.: Fast first-order methods for composite convex optimization with backtracking. Found. Comput. Math. 14, 389–417 (2014)
Article MathSciNet Google Scholar
Shi, B., Du, S.S., Jordan, M.I., Su, W.J.: Understanding the acceleration phenomenon via high-resolution differential equations. arXiv:1810.08907 (2018)
Schmidt, M., Le Roux, N., Bach, F.: Convergence rates of inexact proximal-gradient methods for convex optimization. NIPS’11 - 25th Annual Conference on Neural Information Processing Systems, Dec 2011, Grenada, Spain. HAL-inria-00618152v3 (2011)
Su, W.J., Boyd, S., Candès, E.J.: A Differential Equation for Modeling Nesterov’s Accelerated Gradient Method: Theory and Insights. In: Ghahramani, Z., et al. (eds.) Advances Neural Information Processing Systems 27, pp 2510–2518. Curran Associates Inc. (2014)
Villa, S., Salzo, S., Baldassarre, L., Verri, A.: Accelerated and inexact forward-backward algorithms. SIAM J. Optim. 23, 1607–1633 (2013)
Article MathSciNet Google Scholar

Download references

Author information

Authors and Affiliations

IMAG, CNRS, Université de Montpellier, Montpellier, France
Hedy Attouch
Faculty of Sciences Semlalia, Mathematics, Cadi Ayyad University, 40000, Marrakech, Morocco
Zaki Chbani & Hassan Riahi

Authors

Hedy Attouch
View author publications
You can also search for this author in PubMed Google Scholar
Zaki Chbani
View author publications
You can also search for this author in PubMed Google Scholar
Hassan Riahi
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Hedy Attouch.

Additional information

This paper is dedicated to Professor Marco A. López Cerdá on the occasion of his 70th birthday.

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendix: Some Auxiliary Results

The following lemmas are used throughout the paper. To establish the weak convergence of the iterates of (IP)$_{\alpha _{k}, \beta _{k}}$, we apply Opial’s Lemma [30], that we recall in its discrete form.

Lemma 4

Let S be a nonempty subset of $\mathcal H$, and (x_k) a sequence in ${\mathcal{H}}$. Assume that

(i)
every sequential weak cluster point of (x_k) as $k\to +\infty $, belongs to S;
(ii)
for every z ∈ S, $\lim _{k\to +\infty }\|x_{k}-z\|$ exists.

Then (x_k) converges weakly as $k \to +\infty $ to a point in S.

Owing to the next lemma, we are able to estimate the rate of convergence of a sequence (ε_k) supposed to be non-increasing and summable with respect to weight coefficients, see [5, Lemma 21] for the proof.

Lemma 5

Let (τ_k) be a nonnegative sequence such that ${\sum }_{k=1}^{+\infty } \tau _{k}=+\infty $. Assume that (ε_k) is a non-negative and non-increasing sequence satisfying ${\sum }_{k=1}^{+\infty } \tau _{k} \varepsilon _{k}<+\infty $. Then we have

$$ \varepsilon_{k} = o\left( \frac{1}{{\sum}_{i=1}^{k} \tau_{i}}\right) \quad \text{ as }~ k\to +\infty. $$

The following result shows the summability of a sequence (a_k) satisfying a suitable inequality.

Lemma 6

Given a non-negative sequence (α_k) satisfying (K₀), let (t_k) be the sequence defined by $t_{k}=1+{\sum }_{i=k}^{+\infty }{\prod }_{j=k}^{i}\alpha _{j}$. Let (a_k) and (ω_k) be two nonnegative sequences such that

$$ a_{k+1} \leq \alpha_{k}a_{k}+\omega_{k}, $$

(51)

for all k ≥ 0. If ${\sum }_{k=0}^{+\infty }t_{k+1}\omega _{k}<+\infty $, then ${\sum }_{k=0}^{+\infty }a_{k}<+\infty $.

Proof

By Lemma 1, we have t_k+ 1α_k = t_k − 1. Multiplying inequality (51) by t_k+ 1 gives

$$ t_{k+1}a_{k+1}\leq (t_{k}-1)a_{k}+t_{k+1}\omega_{k}, $$

or equivalently a_k ≤ (t_ka_k − t_k+ 1a_k+ 1) + t_k+ 1ω_k. By summing from k = 0 to n, we obtain

$$ \sum\limits_{k=0}^{n}a_{k} \leq t_{0} a_{0} - t_{n+1}a_{n+1} + \sum\limits_{k=0}^{n}t_{k+1}\omega_{k} \leq t_{0}a_{0} + \sum\limits_{k=0}^{+\infty}t_{k+1}\omega_{k} < +\infty. $$

The conclusion follows by letting n tend to $+\infty $. □

Lemma 7

[8, Lemma 5.14] Let (a_k) be a sequence of nonnegative numbers such that, for all $k\in \mathbb {N}$, ${a_{k}^{2}} \leq c^{2} + {\sum }_{j=1}^{k} b_{j} a_{j}$, where (b_j) is a summable sequence of nonnegative numbers, and c ≥ 0. Then, for all $k\in \mathbb {N}$, $a_{k} \leq c + {\sum }_{j=1}^{\infty } b_{j}$.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Attouch, H., Chbani, Z. & Riahi, H. Convergence Rate of Inertial Proximal Algorithms with General Extrapolation and Proximal Coefficients. Vietnam J. Math. 48, 247–276 (2020). https://doi.org/10.1007/s10013-020-00399-y

Download citation

Received: 16 April 2019
Accepted: 07 October 2019
Published: 29 April 2020
Issue Date: June 2020
DOI: https://doi.org/10.1007/s10013-020-00399-y

Keywords

Mathematics Subject Classification (2010)

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Convergence Rate of Inertial Proximal Algorithms with General Extrapolation and Proximal Coefficients

Abstract

Access this article

Similar content being viewed by others

Convergence of a relaxed inertial proximal algorithm for maximally monotone operators

Convergence Rate of Proximal Inertial Algorithms Associated with Moreau Envelopes of Convex Functions

New inertial proximal gradient methods for unconstrained convex optimization problems

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher’s Note

Appendix: Some Auxiliary Results

Lemma 4

Lemma 5

Lemma 6

Proof

Lemma 7

Rights and permissions

About this article

Cite this article

Keywords

Mathematics Subject Classification (2010)

Navigation

Convergence Rate of Inertial Proximal Algorithms with General Extrapolation and Proximal Coefficients

Abstract

Access this article

Similar content being viewed by others

Convergence of a relaxed inertial proximal algorithm for maximally monotone operators

Convergence Rate of Proximal Inertial Algorithms Associated with Moreau Envelopes of Convex Functions

New inertial proximal gradient methods for unconstrained convex optimization problems

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher’s Note

Appendix: Some Auxiliary Results

Appendix: Some Auxiliary Results

Lemma 4

Lemma 5

Lemma 6

Proof

Lemma 7

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Mathematics Subject Classification (2010)

Search

Navigation