abstract:d53c2436a317d02b.tex

1: \begin{abstract}

2:   We study without-replacement SGD for solving finite-sum optimization problems.

3:   Specifically, depending on how the indices of the finite-sum are shuffled, we consider the $\randshuf$ (shuffle at the beginning of each epoch) and $\singshuf$ (shuffle only once) algorithms.

4:   First, we establish minimax optimal convergence rates of these algorithms up to poly-log factors.

5:   Notably, our analysis is general enough to cover gradient dominated \emph{nonconvex} costs, and does not rely on the convexity of individual component functions unlike existing optimal convergence results.

6:   Secondly, assuming convexity of the individual components, we further sharpen the tight convergence results for $\randshuf$ by removing the drawbacks common to all prior arts: large number of epochs required for the results to hold, and extra poly-log factor gaps to the lower bound.

7: \end{abstract}

8: