f954fd18b6cddfa9.tex
1: \begin{abstract}%
2: As data sets continue to increase in size and multi-core computer architectures are developed, asynchronous parallel optimization algorithms become more and more essential to the field of Machine Learning.
3: Unfortunately, conducting the theoretical analysis asynchronous methods is difficult, notably due to the introduction of delay and inconsistency in inherently sequential algorithms. 
4: Handling these issues often requires resorting to simplifying but unrealistic assumptions. 
5: Through a novel perspective, we revisit and clarify a subtle but important technical issue present in a large fraction of the recent convergence rate proofs for asynchronous parallel optimization algorithms, and propose a simplification of the recently introduced ``perturbed iterate'' framework that resolves it.
6: We demonstrate the usefulness of our new framework by analyzing three distinct asynchronous parallel incremental optimization algorithms: \Hogwild\ (asynchronous \SGD), \KROMAGNON\ (asynchronous \SVRG) and \ASAGA, a novel asynchronous parallel version of the incremental gradient algorithm \SAGA\ that enjoys fast linear convergence rates.
7: We are able to both remove problematic assumptions and obtain better theoretical results.
8: Notably, we prove that \ASAGA\ and \KROMAGNON\ can obtain a theoretical linear speedup on multi-core systems even without sparsity assumptions.
9: We present results of an implementation on a 40-core architecture illustrating the practical speedups as well as the hardware overhead.
10: Finally, we investigate the overlap constant, an ill-understood but central quantity for the theoretical analysis of asynchronous parallel algorithms.
11: We find that it encompasses much more complexity than suggested in previous work, and often is order-of-magnitude bigger than traditionally thought.
12: \end{abstract}
13: