abstract:3644c03d3c79736c.tex

1: \begin{abstract}

2:

3:

4:

5: The message passing-based graph neural networks (GNNs) have achieved great success in many real-world applications.

6: However, training GNNs on large-scale graphs suffers from the well-known {\it neighbor explosion} problem, i.e., the exponentially increasing dependencies of nodes with the number of message passing layers.

7: Subgraph-wise sampling methods---a promising class of mini-batch training techniques---discard messages outside the mini-batches in backward passes to avoid the neighbor explosion problem at the expense of gradient estimation accuracy.

8: This poses significant challenges to their convergence analysis and convergence speeds, which seriously limits their reliable real-world applications.

9: To address this challenge, we propose a novel subgraph-wise sampling method with a convergence guarantee, namely \textbf{L}ocal \textbf{M}essage \textbf{C}ompensation (LMC).

10: To the best of our knowledge, LMC is the {\it first} subgraph-wise sampling method with provable convergence.

11: The key idea of LMC is to retrieve the discarded messages in backward passes based on a message passing formulation of backward passes.

12: By efficient and effective compensations for the discarded messages in both forward and backward passes, LMC computes accurate mini-batch gradients and thus accelerates convergence.

13: We further show that LMC converges to first-order stationary points of GNNs.

14: Experiments on large-scale benchmark tasks demonstrate that LMC significantly outperforms state-of-the-art subgraph-wise sampling methods in terms of efficiency.

15:

16:

17:

18: \end{abstract}

19: