1: \begin{abstract}
2: Recently, federated learning has emerged as a promising approach for training a global model using data from multiple organizations without leaking their raw data.
3: Nevertheless, directly applying federated learning to real-world tasks faces two challenges: (1) heterogeneity in the data among different organizations; and (2) data noises inside individual organizations.
4:
5: In this paper, we propose a general framework to solve the above two challenges
6: simultaneously.
7: Specifically, we propose using distributionally robust optimization to mitigate the negative effects caused by data heterogeneity paradigm to sample clients based on a learnable distribution at each iteration.
8: Additionally, we observe that this optimization paradigm is easily affected by data noises inside local clients, which has a significant performance degradation in terms of global model prediction accuracy.
9: To solve this problem, we propose to incorporate mixup
10: techniques into the local training process of federated learning. We further provide comprehensive
11: theoretical analysis including robustness analysis, convergence analysis, and generalization ability. Furthermore, we conduct empirical studies across different drug discovery tasks, such as ADMET property prediction and drug-target affinity prediction.
12:
13:
14: \end{abstract}
15: