1: \begin{abstract}
2: \Ac{DL} offers a powerful framework where nodes collaboratively train models without sharing raw data and without the coordination of a central server.
3: In the iterative rounds of \Ac{DL}, models are trained locally, shared with neighbors in the topology, and aggregated with other models received from neighbors.
4: Sharing and merging models contribute to convergence towards a consensus model that generalizes better across the collective data captured at training time.
5: In addition, the energy consumption while sharing and merging model parameters is negligible compared to the energy spent during the training phase.
6: Leveraging this fact, we present \sys, a novel \Ac{DL} algorithm, which minimizes energy consumption in decentralized learning by strategically \emph{skipping} some training rounds and substituting them with \emph{synchronization} rounds.
7: These training-silent periods, besides saving energy, also allow models to better mix and finally produce models with superior accuracy than typical \ac{DL} algorithms that train at every round.
8: Our empirical evaluations with 256 nodes demonstrate that \sys reduces energy consumption by 50\% and increases model accuracy by up to 12\% compared to D-PSGD, the conventional \Ac{DL} algorithm.
9: \end{abstract}
10: