fdeb144370e1445b.tex
1: \begin{abstract}  
2: Distributed optimization methods for large-scale machine learning suffer 
3: from a communication bottleneck. It is difficult to reduce this bottleneck while still efficiently and accurately aggregating partial work from different machines. 
4: In this paper, we present a novel generalization of the recent communication-efficient primal-dual framework (\cocoa) for distributed optimization. 
5: Our framework, \cocoap, allows for \emph{additive} combination of local updates 
6: to the global parameters at each iteration, whereas previous schemes with convergence guarantees only 
7: allow conservative averaging. 
8: %
9: We give stronger (primal-dual) convergence 
10: rate guarantees for both \cocoa as well as our new variants, and generalize 
11: the theory for both methods to cover non-smooth convex loss functions. 
12: %
13: We provide an extensive experimental comparison that shows the markedly improved performance of \cocoap on several real-world distributed datasets, especially when scaling up the number of machines.
14: \end{abstract}