bb40947ac229204b.tex
1: \begin{abstract}
2: Krylov methods are a key way of solving large sparse linear systems of equations,
3: but suffer from poor strong scalabilty on distributed memory
4: machines. This is due to high synchronization costs from large numbers of collective
5: communication calls alongside a low computational workload.
6: Enlarged Krylov methods address this issue by decreasing the
7: total iterations to convergence, an artifact of splitting the initial
8: residual and resulting in operations on block vectors.
9: In this paper, we present a performance study of an Enlarged
10: Krylov Method, Enlarged Conjugate Gradients (ECG), noting the
11: impact of block vectors on parallel performance at scale.
12: Most notably, we observe the increased overhead of point-to-point
13: communication as a result of denser messages in the sparse matrix-block
14: vector multiplication kernel. Additionally, we present models
15: to analyze expected performance of ECG, as well as,
16: motivate design decisions. Most importantly, we introduce a new point-to-point
17: communication approach based on node-aware communication techniques that increases
18: efficiency of the method at scale.
19: \end{abstract}
20: