5d80b0effe9e631b.tex
1: \begin{abstract}
2: Federated Learning is a distributed learning paradigm with two key challenges that differentiate it from traditional distributed optimization: (1) significant variability in terms of the systems characteristics on each device in the network (systems heterogeneity), and (2) non-identically distributed data across the network (statistical heterogeneity).   
3: In this work, we introduce a framework, \fedprox, to tackle heterogeneity in federated networks. \fedprox can be viewed as a generalization and re-parametrization of \fedavg, the current state-of-the-art method for federated learning. While \fedprox makes only minor algorithmic modifications to \fedavg, these modifications have important ramifications both in theory and in practice. 
4: Theoretically, we provide convergence guarantees for our framework when learning over data from non-identical distributions (statistical heterogeneity), and while adhering to  device-level systems constraints by allowing each participating device to perform a variable amount of work (systems heterogeneity). 
5: Practically, we demonstrate that \fedprox allows for more robust convergence than \fedavg across a suite of federated datasets. In particular, in highly heterogeneous settings, \fedprox demonstrates significantly more stable and accurate convergence behavior relative to \fedavg---improving absolute test accuracy by 22\% on average. 
6: \end{abstract}
7: