abstract:13e82223fefcfb33.tex

1: \begin{abstract}

2:   \revision{Byzantine}-robustness has been gaining a lot of attention due to the growth of the interest in collaborative and federated learning. However, many fruitful directions, such as the usage of variance reduction for achieving robustness and communication compression for reducing communication costs, remain weakly explored in the field. This work addresses this gap and proposes \algname{Byz-VR-MARINA}--a new \revision{Byzantine}-tolerant method with variance reduction and compression. A key message of our paper is that variance reduction is key to fighting \revision{Byzantine} workers more effectively. At the same time, communication compression is a bonus that makes the process more communication efficient. We derive theoretical convergence guarantees for \algname{Byz-VR-MARINA} outperforming previous state-of-the-art for general non-convex and Polyak-{\L}ojasiewicz loss functions. Unlike the concurrent \revision{Byzantine}-robust methods with variance reduction and/or compression, our complexity results are tight and do not rely on restrictive assumptions such as boundedness of the gradients or limited compression. Moreover, we provide the first analysis of a \revision{Byzantine}-tolerant method supporting non-uniform sampling of stochastic gradients. Numerical experiments corroborate our theoretical findings.

3: \end{abstract}

4: