1: \begin{abstract}
2: With the recent proliferation of large-scale learning problems, there have been a lot of
3: interest on distributed machine learning
4: algorithms, particularly those that are
5: based on stochastic gradient descent (SGD) and its variants.
6: However, existing
7: algorithms either suffer from slow convergence
8: due to the inherent variance of stochastic gradients,
9: or have a fast linear convergence rate
10: but at the expense of poorer solution quality.
11: In this paper, we combine their merits by
12: proposing a
13: fast
14: distributed asynchronous SGD-based algorithm
15: with variance reduction.
16: A constant learning rate
17: can be used,
18: and it is also guaranteed
19: to converge linearly to the optimal solution. Experiments on the Google Cloud Computing
20: Platform demonstrate that the proposed algorithm
21: outperforms state-of-the-art
22: distributed asynchronous algorithms
23: in terms of both wall clock time and solution quality.
24: \end{abstract}
25: