abstract:fdeb144370e1445b.tex

1: \begin{abstract}

2: Distributed optimization methods for large-scale machine learning suffer

3: from a communication bottleneck. It is difficult to reduce this bottleneck while still efficiently and accurately aggregating partial work from different machines.

4: In this paper, we present a novel generalization of the recent communication-efficient primal-dual framework (\cocoa) for distributed optimization.

5: Our framework, \cocoap, allows for \emph{additive} combination of local updates

6: to the global parameters at each iteration, whereas previous schemes with convergence guarantees only

7: allow conservative averaging.

8: %

9: We give stronger (primal-dual) convergence

10: rate guarantees for both \cocoa as well as our new variants, and generalize

11: the theory for both methods to cover non-smooth convex loss functions.

12: %

13: We provide an extensive experimental comparison that shows the markedly improved performance of \cocoap on several real-world distributed datasets, especially when scaling up the number of machines.

14: \end{abstract}