abstract:8b7286dd7f2c6301.tex

1: \begin{abstract}

2: \thispagestyle{empty}

3: {

4: Federated learning (FL) is a promising approach that enables distributed clients to collaboratively train a global model while preserving their data privacy.

5: However, FL often suffers from data heterogeneity problems, which can significantly affect its performance.

6: To address this, clustered federated learning (CFL) has been proposed to construct personalized models for different client clusters.

7: One effective client clustering strategy is to allow clients to choose their own local models from a model pool based on their performance.

8: However, without pre-trained model parameters, such a strategy is prone to clustering failure, in which all clients choose the same model.

9: Unfortunately, collecting a large amount of labeled data for pre-training can be costly and impractical in distributed environments.

10: To overcome this challenge, we leverage self-supervised contrastive learning to exploit unlabeled data for the pre-training of FL systems.

11: Together, self-supervised pre-training and client clustering can be crucial components for tackling the data heterogeneity issues of FL.

12: Leveraging these two crucial strategies, we propose contrastive pre-training--based clustered federated learning (CP-CFL) to improve the model convergence and overall performance of FL systems.

13: In this work, we demonstrate the effectiveness of CP-CFL through extensive experiments in heterogeneous FL settings, and present various interesting observations.

14: }

15:

16: \end{abstract}

17: