abstract:cd6c1f5f8e2e348e.tex

1: \begin{abstract}

2: % previous works

3: %To address the challenge of data heterogeneity in cross-device federated learning (FL), existing approaches have primarily relied on either leveraging public data on the server or generating synthetic data during FL training to facilitate model aggregation. However, these methods are often hindered by limitations associated with the quality of external datasets or the specific tasks they are designed for, rendering them impractical unless certain prerequisites are met.

4: % the proposed work

5: In this work, we propose \texttt{GPT-FL}, a generative pre-trained model-assisted federated learning (FL) framework. At its core, \texttt{GPT-FL} leverages generative pre-trained models to generate diversified synthetic data. These generated data are used to train a downstream model on the server, which is then fine-tuned with private client data under the standard FL framework.

6: %

7: % experiment part

8: We show that \texttt{GPT-FL} consistently outperforms state-of-the-art FL methods in terms of model test accuracy, communication efficiency, and client sampling efficiency.

9: %

10: % In addition, we conduct a comprehensive ablation analysis to elucidate the advantages of \texttt{GPT-FL} and evaluate its performance on benchmark datasets from various modalities, underscoring its versatility and practicality for FL.

11: Through comprehensive ablation analysis, we discover that the downstream model generated by synthetic data plays a crucial role in controlling the direction of gradient diversity during FL training, which enhances convergence speed and contributes to the notable accuracy boost observed with \texttt{GPT-FL}. Also, regardless of whether the target data falls within or outside the domain of the pre-trained generative model, \texttt{GPT-FL} consistently achieves significant performance gains, surpassing the results obtained by models trained solely with FL or synthetic data.

12: %, underscoring its versatility and practicality.

13:

14:

15:

16:

17:

18: \end{abstract}

19: