1: \begin{abstract}
2: We propose a block Krylov subspace version of the GCRO-DR method proposed in
3: [Parks et al. SISC 2005], which is an iterative method
4: allowing for the efficient minimization of the the residual over an augmented block
5: Krylov subspace. We offer a clean derivation of the method and discuss methods of
6: selecting recycling subspaces at restart as well as implementation decisions in the
7: context of high-performance computing. Numerical experiments are split into
8: those demonstrating convergence properties and those demonstrating the data movement
9: and cache efficiencies of the dominant operations of the method, measured using processor
10: monitoring code from Intel.
11: \end{abstract}
12: