365beabe18d7a918.tex
1: \begin{abstract}
2:     This study develops a federated learning (FL) framework 
3:     overcoming largely incremental communication costs due to model sizes in typical frameworks without compromising model performance.
4:     To this end, based on the idea of leveraging an unlabeled open dataset, 
5:     we propose a distillation-based semi-supervised FL  (DS-FL) algorithm that exchanges the outputs of local models among mobile devices,
6:     instead of the model parameter exchange employed by typical frameworks.
7:     In the proposed DS-FL, the communication cost relies only on the output dimensions of the models and does not scale up according to the model size.
8:     The exchanged model outputs are used to label each sample of the open dataset, creating an additionally labeled dataset.
9:     The newly labeled dataset is used for further training the local models,
10:     and model performance is enhanced owing to the data augmentation effect.
11:     We further highlight that in the proposed DS-FL,
12:     the heterogeneity of the devices' dataset leads to the ambiguity of each data sample, lowering the training convergence.
13:     To prevent this, we propose entropy reduction averaging, where the aggregated model outputs are intentionally sharpened.
14:     Moreover, the extensive experiments conducted show that DS-FL reduces the communication costs up to 99\% relative to those of the FL benchmark while achieving similar or higher classification accuracy.
15: \end{abstract}
16: