Global linear convergence of evolution strategies with recombination on scaling-invariant functions

Evolution Strategies (ESs) are stochastic derivative-free optimization algorithms whose most prominent representative, the CMA-ES algorithm, is widely used to solve difficult numerical optimization problems. We provide the first rigorous investigation of the linear convergence of step-size adaptive...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Journal of global optimization 2023-05, Vol.86 (1), p.163-203
Hauptverfasser:	Toure, Cheikh, Auger, Anne, Hansen, Nikolaus
Format:	Artikel
Sprache:	eng
Schlagworte:	Adaptation Algorithms Computer Science Convergence Divergence Evolution Invariants Linear functions Markov analysis Markov chains Markov processes Mathematical optimization Mathematics Mathematics and Statistics Normal distribution Operations Research/Decision Theory Optimization Optimization algorithms Optimization and Control Probability Real Functions Robustness (mathematics) Stochastic models
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	Evolution Strategies (ESs) are stochastic derivative-free optimization algorithms whose most prominent representative, the CMA-ES algorithm, is widely used to solve difficult numerical optimization problems. We provide the first rigorous investigation of the linear convergence of step-size adaptive ESs involving a population and recombination, two ingredients crucially important in practice to be robust to local irregularities or multimodality. We investigate the convergence of step-size adaptive ESs with weighted recombination on composites of strictly increasing functions with continuously differentiable scaling-invariant functions with a global optimum. This function class includes functions with non-convex sublevel sets and discontinuous functions. We prove the existence of a constant r such that the logarithm of the distance to the optimum divided by the number of iterations converges to r . The constant is given as an expectation with respect to the stationary distribution of a Markov chain—its sign allows to infer linear convergence or divergence of the ES and is found numerically. Our main condition for convergence is the increase of the expected log step-size on linear functions. In contrast to previous results, our condition is equivalent to the almost sure geometric divergence of the step-size on linear functions.
ISSN:	0925-5001 1573-2916
DOI:	10.1007/s10898-022-01249-6