abstract:1409704547cf1257.tex

1: \begin{abstract}

2: We consider distributed optimization where $N$ nodes in a connected network minimize

3: the sum of their local costs subject to a common constraint set.

4: We propose a distributed projected gradient method where each node, at each iteration~$k$,

5: performs an update (is active) with probability~$p_k$,

6: and stays idle (is inactive) with probability~$1-p_k$. Whenever active, each node performs an update by

7:  weight-averaging its solution estimate with the estimates of its active neighbors,

8: taking a negative gradient step with respect to its local cost, and performing a projection

9: onto the constraint set; inactive nodes perform no updates.

10: Assuming that nodes' local costs are strongly convex, with Lipschitz continuous gradients, we show that, as long as

11: activation probability $p_k$ grows to one asymptotically,

12: our algorithm converges in the mean square sense (MSS) to the same solution as the standard distributed gradient method,

13: i.e., as if all the nodes were active at all iterations. Moreover, when $p_k$ grows to one linearly,

14: with an appropriately set convergence factor,

15: the algorithm has a linear MSS convergence, with practically the same factor as the standard distributed gradient method.

16: Simulations on both synthetic and real world data sets

17: demonstrate that, when compared with the standard distributed gradient method,

18:  the proposed algorithm significantly reduces

19: the overall number of per-node communications and per-node gradient evaluations (computational cost)

20:  for the same required accuracy.

21: \end{abstract}

22: