1: \begin{abstract}
2: Federated learning allows multiple clients to collaboratively train a model without exchanging their data, thus preserving data privacy.
3: Unfortunately, it suffers significant performance degradation due to heterogeneous data at clients.
4: Common solutions involve designing an auxiliary loss to regularize weight divergence or feature inconsistency during local training.
5: However, we discover that these approaches fall short of the expected performance because they ignore the existence of a \textit{vicious cycle} between feature inconsistency and classifier divergence across clients.
6: This \textit{vicious cycle} causes client models to be updated in inconsistent feature spaces with more diverged classifiers.
7: To break the \textit{vicious cycle}, we propose a novel framework named \textit{\textbf{Fed}erated learning with \textbf{F}eature \textbf{A}nchors} (FedFA).
8: FedFA utilizes feature anchors to align features and calibrate classifiers across clients simultaneously.
9: This enables client models to be updated in a shared feature space with consistent classifiers during local training.
10: Theoretically, we analyze the non-convex convergence rate of FedFA.
11: We also demonstrate that the integration of feature alignment and classifier calibration in FedFA brings a \textit{virtuous cycle} between feature and classifier updates, which breaks the \textit{vicious cycle} existing in current approaches.
12: Extensive experiments show that FedFA significantly outperforms existing approaches on various classification datasets under label distribution skew and feature distribution skew.
13: \end{abstract}
14: