d53c2436a317d02b.tex
1: \begin{abstract}
2:   We study without-replacement SGD for solving finite-sum optimization problems. 
3:   Specifically, depending on how the indices of the finite-sum are shuffled, we consider the $\randshuf$ (shuffle at the beginning of each epoch) and $\singshuf$ (shuffle only once) algorithms. 
4:   First, we establish minimax optimal convergence rates of these algorithms up to poly-log factors. 
5:   Notably, our analysis is general enough to cover gradient dominated \emph{nonconvex} costs, and does not rely on the convexity of individual component functions unlike existing optimal convergence results. 
6:   Secondly, assuming convexity of the individual components, we further sharpen the tight convergence results for $\randshuf$ by removing the drawbacks common to all prior arts: large number of epochs required for the results to hold, and extra poly-log factor gaps to the lower bound.
7: \end{abstract}
8: