e3a922d60dc390e0.tex
1: \begin{abstract}
2: The implementation of a vast majority of machine learning (ML) algorithms boils down
3: to solving a numerical optimization problem. In this context, Stochastic 
4: Gradient Descent (SGD) methods have long proven to provide good results, both
5: in terms of convergence and accuracy. Recently, several parallelization approaches
6: have been proposed in order to scale SGD to solve very large ML problems. 
7: At their core, most of these approaches are following a MapReduce scheme.\\
8: This paper presents a novel parallel updating algorithm for SGD, which utilizes
9: the asynchronous single-sided communication paradigm.  
10: Compared to existing methods, Asynchronous Parallel Stochastic Gradient Descent
11: (ASGD) provides faster convergence, 
12: at linear scalability and stable accuracy.   
13: \end{abstract}