abstract:c1c0dd9579f748aa.tex

1: \begin{abstract}

2: Recently, federated learning has emerged as a promising approach for training a global model using data from multiple organizations without leaking their raw data.

3: Nevertheless, directly applying federated learning to real-world tasks faces two challenges: (1) heterogeneity in the data among different organizations; and (2) data noises inside individual organizations.

4:

5: In this paper, we propose a general framework to solve the above two challenges

6: simultaneously.

7: Specifically, we propose using distributionally robust optimization to mitigate the negative effects caused by data heterogeneity paradigm to sample clients based on a learnable distribution at each iteration.

8: Additionally, we observe that this optimization paradigm is easily affected by data noises inside local clients, which has a significant performance degradation in terms of global model prediction accuracy.

9: To solve this problem, we propose to incorporate mixup

10: techniques into the local training process of federated learning. We further provide comprehensive

11: theoretical analysis including robustness analysis, convergence analysis, and generalization ability. Furthermore, we conduct empirical studies across different drug discovery tasks, such as ADMET property prediction and drug-target affinity prediction.

12:

13:

14: \end{abstract}

15: