5d6e3e44e6326dcb.tex
1: \begin{abstract}
2:     Federated learning (FL), which has gained increasing attention recently, enables distributed devices to train a common machine learning (ML) model for intelligent inference cooperatively without data sharing.
3:     However, problems in practical networks, such as non-independent-and-identically-distributed (non-iid) raw data and limited bandwidth, give rise to slow and unstable convergence of the FL training process.
4:     To address these issues, we propose a new FL method that can significantly mitigate statistical heterogeneity through the \emph{depersonalization mechanism.}
5:     Particularly, we decouple the global and local optimization objectives by alternating stochastic gradient descent, thus reducing the accumulated variance in local update phases to accelerate the FL convergence.
6:     Then we analyze the proposed method in detail to show the proposed method converging at a sublinear speed in the general non-convex setting.
7:     Finally, numerical results are conducted with experiments on public datasets to verify the effectiveness of our proposed method.
8: \end{abstract}
9: