abstract:365beabe18d7a918.tex

1: \begin{abstract}

2:     This study develops a federated learning (FL) framework

3:     overcoming largely incremental communication costs due to model sizes in typical frameworks without compromising model performance.

4:     To this end, based on the idea of leveraging an unlabeled open dataset,

5:     we propose a distillation-based semi-supervised FL  (DS-FL) algorithm that exchanges the outputs of local models among mobile devices,

6:     instead of the model parameter exchange employed by typical frameworks.

7:     In the proposed DS-FL, the communication cost relies only on the output dimensions of the models and does not scale up according to the model size.

8:     The exchanged model outputs are used to label each sample of the open dataset, creating an additionally labeled dataset.

9:     The newly labeled dataset is used for further training the local models,

10:     and model performance is enhanced owing to the data augmentation effect.

11:     We further highlight that in the proposed DS-FL,

12:     the heterogeneity of the devices' dataset leads to the ambiguity of each data sample, lowering the training convergence.

13:     To prevent this, we propose entropy reduction averaging, where the aggregated model outputs are intentionally sharpened.

14:     Moreover, the extensive experiments conducted show that DS-FL reduces the communication costs up to 99\% relative to those of the FL benchmark while achieving similar or higher classification accuracy.

15: \end{abstract}

16: