1: \begin{abstract}
2: Vertical federated learning (VFL) is an emerging paradigm that
3: allows different parties (e.g., organizations or enterprises) to
4: collaboratively build machine learning models with privacy protection.
5: In the training phase, VFL only exchanges the intermediate statistics,
6: i.e., forward activations and backward derivatives,
7: across parties to compute model gradients.
8: Nevertheless, due to its geo-distributed nature,
9: VFL training usually suffers from the low WAN bandwidth.
10:
11: In this paper, we introduce \system, a novel and efficient VFL training framework
12: that exploits the local update technique to reduce the cross-party communication rounds.
13: \system caches the stale statistics and reuses them to estimate model gradients
14: without exchanging the ad hoc statistics.
15: Significant techniques are proposed to improve the convergence performance.
16: First, to handle the stochastic variance problem,
17: we propose a uniform sampling strategy to
18: fairly choose the stale statistics for local updates.
19: Second, to harness the errors brought by the staleness,
20: we devise an instance weighting mechanism that measures the reliability
21: of the estimated gradients.
22: Theoretical analysis proves that \system achieves a similar sub-linear convergence rate
23: as vanilla VFL training but requires much fewer communication rounds.
24: Empirical results on both public and real-world workloads
25: validate that \system can be up to six times faster than the existing works.
26: \end{abstract}
27: