abstract:e632ca52a504b754.tex

1: \begin{abstract}

2: \Ac{DL} offers a powerful framework where nodes collaboratively train models without sharing raw data and without the coordination of a central server.

3: In the iterative rounds of \Ac{DL}, models are trained locally, shared with neighbors in the topology, and aggregated with other models received from neighbors.

4: Sharing and merging models contribute to convergence towards a consensus model that generalizes better across the collective data captured at training time.

5: In addition, the energy consumption while sharing and merging model parameters is negligible compared to the energy spent during the training phase.

6: Leveraging this fact, we present \sys, a novel \Ac{DL} algorithm, which minimizes energy consumption in decentralized learning by strategically \emph{skipping} some training rounds and substituting them with \emph{synchronization} rounds.

7: These training-silent periods, besides saving energy, also allow models to better mix and finally produce models with superior accuracy than typical \ac{DL} algorithms that train at every round.

8: Our empirical evaluations with 256 nodes demonstrate that \sys reduces energy consumption by 50\% and increases model accuracy by up to 12\% compared to D-PSGD, the conventional \Ac{DL} algorithm.

9: \end{abstract}

10: