89adc12c82f96911.tex
1: \begin{abstract}
2: Variance reduced stochastic gradient methods have gained popularity in recent times.
3: Several variants exist with different strategies for the storing and sampling of gradients.
4: In this work we focus on the analysis of the interaction of these two aspects.
5: We present and analyze a general proximal variance reduced gradient method under strong convexity assumptions.
6: Special cases of the algorithm include SAGA, L-SVRG and their proximal variants.
7: Our analysis sheds light on epoch-length selection and the need to balance the convergence of the iterates and how often gradients are stored.
8: The analysis improves on other convergence rates found in literature and produces a new and faster converging sampling strategy for SAGA.
9: Problem instances for which the predicted rates are the same as the practical rates are presented together with problems based on real world data.
10: \end{abstract}
11: