1: \begin{abstract}
2: Core decomposition is a well-established graph mining problem with various applications that involves partitioning the graph into hierarchical subgraphs. Solutions to this problem have been developed using both bottom-up and top-down approaches from the perspective of vertex convergence dependency. However, existing algorithms have not effectively harnessed GPU performance to expedite core decomposition, despite the growing need for enhanced performance. Moreover, approaching performance limitations of core decomposition from two different directions within a parallel synchronization structure has not been thoroughly explored.
3: This paper introduces an efficient GPU acceleration framework, PICO, for the \emph{Peel} and \emph{Index2core} paradigms of $k$-core decomposition. We propose \emph{PeelOne}, a \emph{Peel}-based algorithm designed to simplify the parallel logic and minimize atomic operations by eliminating vertices that are 'under-core'. We also propose an \emph{Index2core}-based algorithm, named \emph{HistoCore}, which addresses the issue of extensive redundant computations across both vertices and edges. Extensive experiments on NVIDIA RTX 3090 GPU show that \emph{PeelOne} outperforms all other \emph{Peel}-based algorithms, and \emph{HistoCore} outperforms all other \emph{Index2core}-based algorithms. Furthermore, \emph{HistoCore} even outperforms \emph{PeelOne} by $1.1\times \sim 3.2\times$ speedup on six datasets, which breaks the stereotype that the \emph{Index2core} paradigm performs much worse than the \emph{Peel} in a shared memory parallel setting.
4:
5: \end{abstract}
6: