abstract:23b976c50507a3b1.tex

1: \begin{abstract}%   <- trailing '%' for backward compatibility of .sty file

2: We propose a remarkably general variance-reduced method suitable for solving regularized empirical risk minimization problems with either a large number of training examples, or a large model dimension, or both. In special cases, our method reduces to several known and previously thought to be unrelated methods, such as {\tt SAGA}~\cite{SAGA}, {\tt LSVRG}~\cite{hofmann2015variance, LSVRG}, {\tt JacSketch}~\cite{gower2018stochastic}, {\tt SEGA}~\cite{hanzely2018sega} and {\tt ISEGA}~\cite{mishchenko201999}, and their arbitrary sampling and proximal generalizations. However, we also highlight a large number of new specific algorithms with interesting properties. We provide a single theorem establishing linear convergence of the method under smoothness and quasi strong convexity assumptions. With this theorem we recover best-known and sometimes improved rates for known methods arising in special cases. As a by-product, we provide the first unified method and theory for stochastic gradient and stochastic coordinate descent type methods.

3: \end{abstract}

4: