8a38548fe093690d.tex
1: \begin{abstract}
2: Recent advances in graph processing on FPGAs promise to alleviate performance bottlenecks with irregular memory access patterns.
3: Such bottlenecks challenge performance for a growing number of important application areas like machine learning and data analytics.
4: While FPGAs denote a promising solution through flexible memory hierarchies and massive parallelism, we argue that current graph processing accelerators either use the off-chip memory bandwidth inefficiently or do not scale well across memory channels.
5: 
6: In this work, we propose GraphScale, a scalable graph processing framework for FPGAs.
7: For the first time, GraphScale combines multi-channel memory with asynchronous graph processing (\ie for fast convergence on results) and a compressed graph representation (\ie for efficient usage of memory bandwidth and reduced memory footprint).
8: GraphScale solves common graph problems like breadth-first search, PageRank, and weakly-connected components through modular user-defined functions, a novel two-dimensional partitioning scheme, and a high-performance two-level crossbar design.
9: \end{abstract}
10: