1: \begin{abstract}
2: We consider distributed optimization where $N$ nodes in a connected network minimize
3: the sum of their local costs subject to a common constraint set.
4: We propose a distributed projected gradient method where each node, at each iteration~$k$,
5: performs an update (is active) with probability~$p_k$,
6: and stays idle (is inactive) with probability~$1-p_k$. Whenever active, each node performs an update by
7: weight-averaging its solution estimate with the estimates of its active neighbors,
8: taking a negative gradient step with respect to its local cost, and performing a projection
9: onto the constraint set; inactive nodes perform no updates.
10: Assuming that nodes' local costs are strongly convex, with Lipschitz continuous gradients, we show that, as long as
11: activation probability $p_k$ grows to one asymptotically,
12: our algorithm converges in the mean square sense (MSS) to the same solution as the standard distributed gradient method,
13: i.e., as if all the nodes were active at all iterations. Moreover, when $p_k$ grows to one linearly,
14: with an appropriately set convergence factor,
15: the algorithm has a linear MSS convergence, with practically the same factor as the standard distributed gradient method.
16: Simulations on both synthetic and real world data sets
17: demonstrate that, when compared with the standard distributed gradient method,
18: the proposed algorithm significantly reduces
19: the overall number of per-node communications and per-node gradient evaluations (computational cost)
20: for the same required accuracy.
21: \end{abstract}
22: