abstract:bb40947ac229204b.tex

1: \begin{abstract}

2: Krylov methods are a key way of solving large sparse linear systems of equations,

3: but suffer from poor strong scalabilty on distributed memory

4: machines. This is due to high synchronization costs from large numbers of collective

5: communication calls alongside a low computational workload.

6: Enlarged Krylov methods address this issue by decreasing the

7: total iterations to convergence, an artifact of splitting the initial

8: residual and resulting in operations on block vectors.

9: In this paper, we present a performance study of an Enlarged

10: Krylov Method, Enlarged Conjugate Gradients (ECG), noting the

11: impact of block vectors on parallel performance at scale.

12: Most notably, we observe the increased overhead of point-to-point

13: communication as a result of denser messages in the sparse matrix-block

14: vector multiplication kernel. Additionally, we present models

15: to analyze expected performance of ECG, as well as,

16: motivate design decisions. Most importantly, we introduce a new point-to-point

17: communication approach based on node-aware communication techniques that increases

18: efficiency of the method at scale.

19: \end{abstract}

20: