2236c0c3918cf5f1.tex
1: \begin{abstract}
2: Vertical federated learning (VFL) is an emerging paradigm that 
3: allows different parties (e.g., organizations or enterprises) to 
4: collaboratively build machine learning models with privacy protection. 
5: In the training phase, VFL only exchanges the intermediate statistics, 
6: i.e., forward activations and backward derivatives, 
7: across parties to compute model gradients. 
8: Nevertheless, due to its geo-distributed nature, 
9: VFL training usually suffers from the low WAN bandwidth. 
10: 
11: In this paper, we introduce \system, a novel and efficient VFL training framework 
12: that exploits the local update technique to reduce the cross-party communication rounds. 
13: \system caches the stale statistics and reuses them to estimate model gradients 
14: without exchanging the ad hoc statistics. 
15: Significant techniques are proposed to improve the convergence performance. 
16: First, to handle the stochastic variance problem, 
17: we propose a uniform sampling strategy to 
18: fairly choose the stale statistics for local updates. 
19: Second, to harness the errors brought by the staleness, 
20: we devise an instance weighting mechanism that measures the reliability 
21: of the estimated gradients. 
22: Theoretical analysis proves that \system achieves a similar sub-linear convergence rate 
23: as vanilla VFL training but requires much fewer communication rounds. 
24: Empirical results on both public and real-world workloads 
25: validate that \system can be up to six times faster than the existing works. 
26: \end{abstract}
27: