abstract:e3a922d60dc390e0.tex

1: \begin{abstract}

2: The implementation of a vast majority of machine learning (ML) algorithms boils down

3: to solving a numerical optimization problem. In this context, Stochastic

4: Gradient Descent (SGD) methods have long proven to provide good results, both

5: in terms of convergence and accuracy. Recently, several parallelization approaches

6: have been proposed in order to scale SGD to solve very large ML problems.

7: At their core, most of these approaches are following a MapReduce scheme.\\

8: This paper presents a novel parallel updating algorithm for SGD, which utilizes

9: the asynchronous single-sided communication paradigm.

10: Compared to existing methods, Asynchronous Parallel Stochastic Gradient Descent

11: (ASGD) provides faster convergence,

12: at linear scalability and stable accuracy.

13: \end{abstract}