1: \begin{abstract}
2:
3: A scalable semi-supervised node classification method on
4: graph-structured data, called GraphHop, is proposed in this work. The
5: graph contains attributes of all nodes but labels of a few nodes. The
6: classical label propagation (LP) method and the emerging graph
7: convolutional network (GCN) are two popular semi-supervised solutions to
8: this problem. The LP method is not effective in modeling node attributes
9: and labels jointly or facing a slow convergence rate on large-scale graphs. GraphHop is proposed to its shortcoming. With proper
10: initial label vector embeddings, each iteration of GraphHop contains two
11: steps: 1) label aggregation and 2) label update. In Step 1, each node
12: aggregates its neighbors' label vectors obtained in the previous
13: iteration. In Step 2, a new label vector is predicted for each node
14: based on the label of the node itself and the aggregated label
15: information obtained in Step 1. This iterative procedure exploits the
16: neighborhood information and enables GraphHop to perform well in an
17: extremely small label rate setting and scale well for very large graphs.
18: Experimental results show that GraphHop outperforms state-of-the-art
19: graph learning methods on a wide range of tasks (e.g., multi-label and
20: multi-class classification on citation networks, social graphs, and
21: commodity consumption graphs) in graphs of various sizes. Our codes are
22: publicly available on GitHub \footnote{https://github.com/TianXieUSC/GraphHop}.
23:
24: \end{abstract}
25: