1: \begin{abstract}
2: This study develops a federated learning (FL) framework
3: overcoming largely incremental communication costs due to model sizes in typical frameworks without compromising model performance.
4: To this end, based on the idea of leveraging an unlabeled open dataset,
5: we propose a distillation-based semi-supervised FL (DS-FL) algorithm that exchanges the outputs of local models among mobile devices,
6: instead of the model parameter exchange employed by typical frameworks.
7: In the proposed DS-FL, the communication cost relies only on the output dimensions of the models and does not scale up according to the model size.
8: The exchanged model outputs are used to label each sample of the open dataset, creating an additionally labeled dataset.
9: The newly labeled dataset is used for further training the local models,
10: and model performance is enhanced owing to the data augmentation effect.
11: We further highlight that in the proposed DS-FL,
12: the heterogeneity of the devices' dataset leads to the ambiguity of each data sample, lowering the training convergence.
13: To prevent this, we propose entropy reduction averaging, where the aggregated model outputs are intentionally sharpened.
14: Moreover, the extensive experiments conducted show that DS-FL reduces the communication costs up to 99\% relative to those of the FL benchmark while achieving similar or higher classification accuracy.
15: \end{abstract}
16: