5ddfdeaef2f15df3.tex
1: \begin{abstract}
2:   Stochastic Gradient Descent (SGD) has become popular for solving
3:   large scale supervised machine learning optimization problems such
4:   as SVM, due to their strong theoretical guarantees.  While the
5:   closely related Dual Coordinate Ascent (DCA) method has been
6:   implemented in various software packages, it has so far lacked good
7:   convergence analysis.  This paper presents a new analysis of
8:   Stochastic Dual Coordinate Ascent (SDCA) showing that this class of
9:   methods enjoy strong theoretical guarantees that are comparable or
10:   better than SGD. This analysis justifies the effectiveness of SDCA
11:   for practical applications.
12: \end{abstract}
13: