Stochastic Frank-Wolfe: Unified Analysis and Zoo of Special Cases
The Conditional Gradient (or Frank-Wolfe) method is one of the most well-known methods for solving constrained optimization problems appearing in various machine learning tasks. The simplicity of iteration and applicability to many practical problems helped the method to gain popularity in the commu...
Gespeichert in:
Hauptverfasser: | , , , , , |
---|---|
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | The Conditional Gradient (or Frank-Wolfe) method is one of the most
well-known methods for solving constrained optimization problems appearing in
various machine learning tasks. The simplicity of iteration and applicability
to many practical problems helped the method to gain popularity in the
community. In recent years, the Frank-Wolfe algorithm received many different
extensions, including stochastic modifications with variance reduction and
coordinate sampling for training of huge models or distributed variants for big
data problems. In this paper, we present a unified convergence analysis of the
Stochastic Frank-Wolfe method that covers a large number of particular
practical cases that may have completely different nature of stochasticity,
intuitions and application areas. Our analysis is based on a key parametric
assumption on the variance of the stochastic gradients. But unlike most works
on unified analysis of other methods, such as SGD, we do not assume an
unbiasedness of the real gradient estimation. We conduct analysis for convex
and non-convex problems due to the popularity of both cases in machine
learning. With this general theoretical framework, we not only cover rates of
many known methods, but also develop numerous new methods. This shows the
flexibility of our approach in developing new algorithms based on the
Conditional Gradient approach. We also demonstrate the properties of the new
methods through numerical experiments. |
---|---|
DOI: | 10.48550/arxiv.2406.06788 |