Recently, it was posited that disparate optimization algorithms may be coalesced in terms of a central source emanating from optimal control theory. Here we further this proposition by showing how coordinate descent algorithms may be derived from this emerging new principle. In particular, we show that basic coordinate descent algorithms can be derived using a maximum principle and a collection of max functions as “control” Lyapunov functions. The convergence of the resulting coordinate descent algorithms is thus connected to the controlled dissipation of their corresponding Lyapunov functions. The operational metric for the search vector in all cases is given by the Hessian of the convex objective function.
This is a preview of subscription content, log in via an institution to check access.
Access this article Subscribe and saveSpringer+ Basic
€34.99 /Month
Price includes VAT (Germany)
Instant access to the full article PDF.
Explore related subjectsDiscover the latest articles and news from researchers in related subjects, suggested using machine learning. Data AvailabilityMy manuscript has no associated data.
ReferencesRoss IM (2019) An optimal control theory for nonlinear optimization. J Comp Appl Math 354:39–51
Vinter RB (2000) Optimal Control. Birkhäuser, Boston, MA
Bazaraa MS, Sherali HD, Shetty CM (2006) Nonlinear Programming: Theory and Algorithms, 3rd edn. Wiley-Interscience, Hoboken, NJ, pp 367–369
Sontag ED, Sussmann HJ (1996) General classes of control-Lyapunov functions. ISNM International Series of Numerical Mathematics, vol 121. Birkhäuser Basel, pp 87–97
Polyak B, Shcherbakov P (2017) Lyapunov functions: an optimization theory perspective. IFAC PapersOnLine 50–1:7456–7461
Wright SJ (2015) Coordinate descent algorithms. Math Program 151(1):3–34
Bottou L, Curtis FE, Nocedal J (2018) Optimization methods for large-scale machine learning. SIAM Rev 60(2):223–311
Nesterov Yu (2012) Efficiency of coordinate descent methods on huge-scale optimization problems. SIAM J. Opt. 22(2):341–362
Bengio Y, LeCun Y, Hinton G (2015) Deep Learning. Nature 521:436–444
Thibault J-B, Sauer KD, Bouman CA, Hsieh J (2007) A three-dimensional statistical approach to improved image quality for multislice helical CT. Med Phys 34(11):4526–4544
Wang H et al (2022) A comprehensive survey on training acceleration for large machine learning models in IoT. IEEE Internet Things J 9(2):939–963
Ifrim G, Carsten W (2011) Bounded coordinate-descent for biological sequence classification in high dimensional predictor space. Proceedings in 17th ACM SIGKDD Intl. Conf. on knowledge discovery and data mining, KDD, pp 708–716
Ross IM (2015) A Primer on Pontryagin’s Principle in Optimal Control, 2nd edn. Collegiate Publishers, San Francisco, CA
Goh BS (1997) Algorithms for unconstrained optimization via control theory. J Optim Theory Appl 92(3):581–604
Lee MS, Harno HG, Goh BS, Lim KH (2021) On the bang-bang control approach via a component-wise line search strategy for unconstrained optimization. Numerical Algebra, Control and Optimization 11(1):45–61
Clarke F (2004) Lyapunov functions and feedback in nonlinear control. In: de Queiroz MS, Malisoff M, Wolenski P (eds) Optimal control, stabilization and nonsmooth analysis. Lecture Notes in Control and Information Science, vol 301. Springer, Berlin, Heidelberg, pp 267–282
Bhaya A, Kaszkurewicz E (2006) Control Perspectives on Numerical Algorithms and Matrix Problems. SIAM, Philadelphia, PA
Nocedal J, Wright S (2006) Numerical Optimization, 2nd edn. Springer, New York, NY
Hiriart-Urruty J-B, Lemaréchal C (1996) Convex Analysis and Minimization Algorithms I. Springer-Verlag, Berlin Heidelberg GmbH
Borwein J, Lewis A (2006) Convex Analysis and Nonlinear Optimization: Theory and Examples, 2nd edn. Springer, New York, N.Y
Boyd S, Vandenberghe L (2009) Convex Optimization. Cambridge University Press, Cambridge, UK
Davidon WC (1991) Variable metric method for minimization, SIAM J. optimization, 1/1, pp. 1–17 (originally published as Argonne National Laboratory Research and Development Report 5990, May 1959; revised November 1959)
Funding for this research was provided by the Air Force Office of Scientific Research.
Author information Authors and AffiliationsControl & Optimization Laboratories, Naval Postgraduate School, 1 University Circle, Monterey, CA, 93943, USA
Isaac M. Ross
Correspondence to Isaac M. Ross.
Ethics declarations Conflict of InterestThe author declares no competing interests.
Additional information Publisher’s NoteSpringer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
About this article Cite this articleRoss, I. Derivation of Coordinate Descent Algorithms from Optimal Control Theory. Oper. Res. Forum 4, 31 (2023). https://doi.org/10.1007/s43069-023-00215-6
Received: 26 January 2022
Accepted: 02 March 2023
Published: 31 March 2023
DOI: https://doi.org/10.1007/s43069-023-00215-6
RetroSearch is an open source project built by @garambo | Open a GitHub Issue
Search and Browse the WWW like it's 1997 | Search results from DuckDuckGo
HTML:
3.2
| Encoding:
UTF-8
| Version:
0.7.4