1: \begin{abstract}
2: Stochastic Gradient Descent (SGD) has become popular for solving
3: large scale supervised machine learning optimization problems such
4: as SVM, due to their strong theoretical guarantees. While the
5: closely related Dual Coordinate Ascent (DCA) method has been
6: implemented in various software packages, it has so far lacked good
7: convergence analysis. This paper presents a new analysis of
8: Stochastic Dual Coordinate Ascent (SDCA) showing that this class of
9: methods enjoy strong theoretical guarantees that are comparable or
10: better than SGD. This analysis justifies the effectiveness of SDCA
11: for practical applications.
12: \end{abstract}
13: