abstract:2236c0c3918cf5f1.tex

1: \begin{abstract}

2: Vertical federated learning (VFL) is an emerging paradigm that

3: allows different parties (e.g., organizations or enterprises) to

4: collaboratively build machine learning models with privacy protection.

5: In the training phase, VFL only exchanges the intermediate statistics,

6: i.e., forward activations and backward derivatives,

7: across parties to compute model gradients.

8: Nevertheless, due to its geo-distributed nature,

9: VFL training usually suffers from the low WAN bandwidth.

10:

11: In this paper, we introduce \system, a novel and efficient VFL training framework

12: that exploits the local update technique to reduce the cross-party communication rounds.

13: \system caches the stale statistics and reuses them to estimate model gradients

14: without exchanging the ad hoc statistics.

15: Significant techniques are proposed to improve the convergence performance.

16: First, to handle the stochastic variance problem,

17: we propose a uniform sampling strategy to

18: fairly choose the stale statistics for local updates.

19: Second, to harness the errors brought by the staleness,

20: we devise an instance weighting mechanism that measures the reliability

21: of the estimated gradients.

22: Theoretical analysis proves that \system achieves a similar sub-linear convergence rate

23: as vanilla VFL training but requires much fewer communication rounds.

24: Empirical results on both public and real-world workloads

25: validate that \system can be up to six times faster than the existing works.

26: \end{abstract}

27: