0710:0710.3612/mg.tex

1:

2: % Please use the skeleton file you have received in the

3: % invitation-to-submit email, where your data are already

4: % filled in. Otherwise please make sure you insert your

5: % data according to the instructions in PoSauthmanual.pdf

6: \documentclass{PoS}

7:

8: \def\mg{MG}

9: \def\rmsub#1#2{#1_{\mbox{\tiny #2}}} % Small roman subscript

10: \def\secspace{\vspace{-0.38mm}}

11:

12: \title{Adaptive Multigrid Algorithm for the QCD Dirac-Wilson Operator}

13:

14: \ShortTitle{Adaptive Multigrid Algorithm for the QCD Dirac-Wilson Operator}

15:

16: \author{J. Brannick$^a$, R. C. Brower$^{bc}$,

17:   \speaker{M. A. Clark}$^b$, J. C. Osborn$^{bd}$ and C. Rebbi$^{bc}$\\

18:   \llap{$^a$} Department of Mathematics, The Pennsylvania State

19:   University, 230 McAllister Building,

20:   University Park, PA 16802, USA\\

21:   \llap{$^b$} Center for Computational Science, Boston University, 3

22:   Cummington St,

23:   MA 02215, USA\\

24:   \llap{$^c$} Department of Physics, Boston University,

25:   590 Commonwealth Avenue, Boston,  MA 02215,  USA\\

26:   \llap{$^d$} Argonne Leadership Computing Facility, 9700 S. Cass

27:   Avenue, Argonne, IL 60439, USA}

28:

29:       \abstract{We present a new multigrid solver that is suitable

30:         for the Dirac operator in the presence of disordered gauge

31:         fields. The key behind the success of the algorithm is an

32:         adaptive projection onto the coarse grids that preserves the

33:         near null space. The resulting algorithm has weak dependence

34:         on the gauge coupling and exhibits mild critical slowing down in

35:         the chiral limit. Results are presented for the Wilson Dirac

36:         operator of the 2d U(1) Schwinger model.}

37:

38: \FullConference{The XXV International Symposium on Lattice Field Theory\\

39: 		 July 30-4 August 2007\\

40: 		 Regensburg, Germany}

41:

42: \begin{document}

43:

44: \section{Introduction}

45: \secspace

46: The most demanding computational task in lattice QCD simulations

47: consists of the calculation of quark propagators, which are needed

48: both for generating gauge field configurations with the appropriate

49: measure and for the evaluation of most observables.  This calculation

50: consists of solving a very large system of linear equations,

51: \begin{equation}

52: D(U) \psi = \chi,

53: \label{eq:Dpsichi}

54: \end{equation}

55: where $\psi$ is the quark propagator, $\chi$ is the source term and

56: $D(U)$ is the discretized the Dirac operator matrix, with elements

57: dependent on the gauge field background $U$.

58:

59: In the language of applied mathematics, Eq.~(\ref{eq:Dpsichi}) is a

60: discretized elliptic partial differential equation (PDE).  For

61: definiteness,

62: \begin{displaymath}

63:   D_{x,y} = - \frac{1}{2} \displaystyle \sum_{\mu=1}^{d} \bigl( (1-\gamma_\mu)U_x^\mu\,

64:   \delta_{x+\hat\mu,y}\, + (1+\gamma_\mu)U_{x-\hat\mu}^{\mu \dagger}\, \delta_{x-\hat\mu,y}\bigr) +

65:   (2d + m)\delta_{x,y}

66: \end{displaymath}

67: is the discretized Dirac operator describing a fermion in \(d\)

68: dimensions with mass \(m\) in the Wilson discretization of the Dirac

69: equation.  In the full 4 dimensional QCD problem (in volume \(V\)) the

70: matrices \(\gamma_\mu\) are the \(4\times4\) Dirac spin matrices and

71: \(U\) is the \(SU(3)\) gauge field.  It is this formulation that we

72: concentrate upon, however, we point out that many of the problems

73: encountered in solving this equation extend to other

74: formulations.

75:

76: For any realistic QCD calculation the size of the matrix in

77: Eq.~(\ref{eq:Dpsichi}) is too large for a direct solver and iterative

78: Krylov-space methods must be used.  As the quark mass is brought

79: towards zero, the condition number of the matrix diverges and hence so

80: does the number of iterations until the desired convergence.  This

81: scaling with the mass is commonly referred to as critical slowing

82: down.

83:

84: It has been known for some time that the multigrid (\mg) approach is

85: optimal when solving systems of the form \(Ax=b\), where \(A\) is the

86: sparse matrix that arises from the discretization of continuum

87: differential equations, \(b\) is a source vector and \(x\) is the

88: desired solution vector.  Here discretizations on successively coarser

89: (blocked) grids are used to accelerate the solver and this approach is

90: known to remove critical slowing down~\cite{Brandt:1977}.

91:

92: One exception to the above statement is in solving the Dirac operator

93: in lattice QCD: here the nature of the underlying gauge field in the

94: Dirac operator has proven to be especially resistant to \mg.  Previous

95: attempts at \mg\ solvers have relied on renormalization group

96: arguments to define the coarse grids without realizing why the \mg\

97: approach succeeds, and this has invariably led to failure as the

98: physically interesting regime is approached

99: ~\cite{Brower:1991xv,Lauwers:1992cp}~\footnote{We note, however, that

100:   recent progress has been made in the use of renormalization group to

101:   define a coarse Dirac operator, which may render this statement

102:   erroneous~\cite{Borici:2007ft}.}.  In this work we demonstrate an

103: \mg\ algorithm for the Dirac operator normal equations, i.e., the

104: positive definite operator given by

105: \[

106: A= D^\dagger D,

107: \]

108: that is shown to work in all regimes and vastly reduces the notorious

109: critical slowing of the solver as the renormalized fermion mass is

110: brought to zero.  We do so in the context of a 2-dimensional system

111: with $U(1)$ gauge field (Schwinger model).  This system captures many

112: of the physical properties (confinement, chiral symmetry breaking,

113: existence of non-trivial topological sectors) of the more complex

114: 4-dimensional QCD.

115:

116: \secspace

117: \section{Multigrid}

118: \secspace

119: The original formulation of \mg\ is best viewed with the example of

120: the free Dirac operator.  Multigrid solvers are based on the

121: observation that stationary iterative solvers (e.g., Jacobi,

122: Gauss-Seidel) are only effective at reducing local error components

123: leaving slow to converge, low wave-number components in the error.

124: For the free Dirac operator these slow modes will be geometrically

125: smooth and can be accurately represented on a coarser grid using

126: simple linear averaging.  However, on the coarse grid these low

127: wave-number error components become modes of shorter range and so

128: relaxation should be effective at removing them.  This process can

129: continue, moving to coarser and coarser grids until the degrees of

130: freedom have been thinned enough to solve the system exactly.  The

131: solution is then promoted back to the finest grid using linear

132: interpolation, where at each level relaxation is applied to the

133: correction vector to remove any high wave-number error components that

134: were introduced.  This process is known as a V-cycle~\cite{Brandt:1977}

135: and can be used as a solver in its own right, or more effectively as a

136: preconditioner for a Krylov method e.g., conjugate gradients (CG).

137:

138: Before continuing we introduce the notation where the degree of

139: coarseness is represented by the integer \(l\), where \(l=1\)

140: represents the finest grid (i.e., where our actual problem is defined)

141: and \(l=L\) is the coarsest grid in an \(L\)-level \mg\ algorithm.

142: The operator used to promote a coarse grid vector on grid \(l+1\) to

143: the adjacent fine grid \(l\) is known as the prolongator

144: \(P^{(l,l+1)}\), and the converse operator is the restriction operator

145: \(Q^{(l+1,l)}\) which projects a fine grid vector onto the adjacent

146: coarse grid.  Typically the Galerkin definition is used to define the

147: coarse grid operator~\cite{Brandt:1977},

148: \begin{equation}

149:   A^{(l+1)} = Q^{(l+1,l)} A^{(l)} P^{(l,l+1)} = P^{(l,l+1)\dagger} A^{(l)} P^{(l,l+1)},

150: \label{eq:Galerkin}

151: \end{equation}

152: where we have defined the restriction operator as \(Q=P^\dagger\).

153: This guarantees the coarse grid operator is Hermitian positive

154: definite.  That the Galerkin definition is the optimum definition for

155: \(A\) can be found by minimizing the error of the coarse grid

156: corrected solution vector in the \(A\)-norm.  Apart from the coarsest

157: level which is an exact solve, each level of the V-cycle is the

158: following

159: \begin{enumerate}

160: \item Relax on the input vector, \(x^{(l)} = R^{(l)\dagger} b^{(l)}\),

161:   where \(R^{(l)\dagger}\) is a suitable relaxation

162:   operator.\footnote{The relaxation operator need not be Hermitian for

163:     the entire V-cycle to be Hermitian: the post-relaxation operator

164:     need only be the Hermitian conjugate to pre-relaxation.}

165: \item Restrict the resulting residual to the next coarsest grid,

166:   \(r^{(l+1)} = P^{(l,l+1)\dagger} (b^{(l)}-A^{(l)}x^{(l)})\).

167: \item Apply the \(L=l+1\) V-cycle on the coarse residual,

168:   \(e^{(l+1)} = V^{(l+1)} r^{(l+1)}\).

169: \item Correct the current solution with coarse grid correction,

170:   \(x^{(l)} = x^{(l)} + P^{(l,l+1)} e^{(l+1)}\).

171: \item Relax on the final residual, \( x^{(l)} = R^{(l)} (b^{(l)} - A^{(l)}x^{(l)})\).

172: \end{enumerate}

173: Written explicitly in terms of operators the \(l^{th}\) level of the

174: V-cycle thus takes the following form

175: \begin{equation}

176:   V^{(l)} = R^{(l)} + R^{(l)\dagger} + R^{(l)}A^{(l)}R^{(l)\dagger} +

177:   \Big[ (1 - R^{(l)}A^{(l)}) P^{(l,l+1)} V^{(l+1)}

178:   P^{(l,l+1)\dagger} (1 - A^{(l)} R^{(l)\dagger}) \Big].

179: \label{eq:vcycle}

180: \end{equation}

181: In this form the Hermiticity of the V-cycle is obvious (a necessary

182: condition if we are to use the V-cycle as a CG preconditioner).  The

183: cost of applying the \mg\ V-cycle becomes apparent from this explicit

184: form: on each level we must apply the operator \(A^{(l)}\) a total of

185: \(2\nu + 2\) times for each \(l\), where \(\nu\) is the number of

186: steps within the relaxation operator.

187:

188: The problem in the application of the above procedure in the presence

189: of a non-trivial gauge field is that the eigenvectors responsible for

190: slow convergence are no longer low wave-number modes with

191: geometrically smooth variation.  They are instead modes that exhibit

192: localized lumps, typically extending over several lattice spacings.

193: An approach that was followed in \cite{Brower:1991xv} was to impose

194: Dirichlet boundary conditions along the block boundaries, and to use

195: the low modes of resulting blocked operator to define the prolongator.

196: This approach is bound to produce only a limited advantage since the

197: lumps of the low modes can span between several such blocks, so the

198: blocked operator will not possess this property.  Indeed in

199: \cite{Brower:1991xv} some acceleration was obtained but critical

200: slowing down was found to return after the correlation length of the

201: pion \(\mu^{-1}\) exceeded the correlation length \(l_\sigma\) of the

202: underlying gauge field.

203:

204: \secspace

205: \section{Adaptive Multigrid}

206: \label{sec:adaptive}

207: \secspace

208: A breakthrough in the application of multiscale methods to more

209: complex problems occurred with the discovery of adaptive \mg\

210: techniques~\cite{Brezina:2004, Brannick:2006}.  In the adaptive

211: algorithm the \mg\ process itself defines the appropriate prolongator

212: by an iterative procedure which we now concisely describe.

213:

214: In the first pass, one uses relaxation alone to solve the homogenous

215: problem \(Ae=0\) with a randomly chosen initial error vector.  After a

216: certain number, \(\nu\), of relaxation steps, the relaxation procedure, which we

217: symbolically represent by

218: \begin{equation}

219: e \to e^\prime=(I-\omega A)^\nu e \equiv (I- \omega D^\dagger D)^\nu e,

220: \label{eq:relax}

221: \end{equation}

222: produces an \(e^\prime\) that essentially belongs to the space spanned

223: by the slow modes, so $e^\prime$ is now used to define a first

224: approximation to the prolongator $P$.  One blocks the variables of the

225: original lattice into subsets, which we denote by $S_j$.  From

226: $e^\prime$ we construct the vectors $e^\prime_j$, which are identical

227: to $e^\prime$ within $S_j$ and $0$ outside $S_j$, and the vectors of

228: unit norm $v_{1j}=e'_j /\vert e'_j \vert$.  The extra ``1'' index in

229: $v_{1j}$ has been introduced for a discussion that follows.  The

230: prolongator $P^{(1,2)} \equiv P^{(1,2)}_{i,j}$ which maps a vector

231: $\psi_j^{(2)}$ in the coarse lattice, indexed by $j$, to the original

232: lattice, where $i$ denotes collectively the site, spin and possible

233: internal symmetry indices, is then defined by

234: \begin{equation}

235: P^{(1,2)}_{i,j}=v_{1j,i},

236: \label{eq:P1}

237: \end{equation}

238: where we have made explicit the fine lattice indices of $v_{1j}$.

239:

240: There are variations on how to block the fine lattice, i.e.,~how to

241: define the sets $S_j$.  In the so called ``algebraic adaptive MG'' one

242: partitions the fine lattice into subsets on the basis of the magnitude

243: of the matrix elements of $A$.  Since such matrix elements in lattice

244: gauge theories are typically of uniform magnitude, differing rather in

245: phase or, in a broader sense, in orientation within the space of gauge

246: transformations, we chose instead to partition the lattice

247: geometrically into fixed blocks of neighboring lattice sites,

248: specifically $4\times 4$ squares in our study of the Schwinger model.

249: Maintaining a regular lattice on coarse levels will allow more

250: efficient parallel code with exact load balancing.

251:

252: Another refinement of the technique consists of applying a simple

253: Richardson iteration to the vectors $v_{1j}$ before defining the

254: prolongator.  The choice of damping parameter in this smoothing

255: procedure is chosen to minimize the condition number of the resulting

256: coarse grid operator.  The term ``smoothed aggregation'' is used for

257: this.  Thus our overall technique can be referred to as ``geometric

258: adaptive smoothly aggregated MG''.

259:

260: We come now to the crux of the adaptive \mg\ method.  We use the

261: prolongator defined above (Eq.~(\ref{eq:P1})) to implement a standard

262: \mg\ V-cycle and apply it, like relaxation before, to a randomly

263: chosen error vector.  There are two possibilities.  Either the V-cycle

264: reduces the error with no sign of critical slowing down or some large

265: error, $e^{\prime \prime}$, survives the cycle.  In the first case, of

266: course, one need not proceed: the \mg\ procedure works as is.  In the

267: second case, we define another set of vectors $v_{2j}$ over the coarse

268: lattice by restricting $e^{\prime \prime}$ to the subsets $S_j$,

269: making the new vectors orthogonal to the vectors $v_{1j}$ and

270: normalizing them to 1.  The smoothed aggregation procedure is now

271: applied to the set $v_{sj}\equiv (v_{1j}, v_{2j})$.  A new prolongator

272: is defined by projecting over these vectors

273: \begin{displaymath}

274: P_{i,sj}^{(1,2)}=v_{sj,i},

275: \end{displaymath}

276: where the index $s$ (taking values $1,2$) can be considered

277: as an intrinsic index over the coarse lattice.

278:

279: The procedure described in the above paragraph is repeated as

280: necessary, until the repeated application of a V-cycle reduces a

281: random initial error sufficiently without critical slowing down.  The

282: method works if critical slowing down is eliminated with a few

283: iterations of the adaptive procedure.  If this occurs with $M$ vector

284: sets, then the coarse lattice will carry $M$ degrees of freedom per

285: site.  As with all \mg\ methods, the procedure is recursive and it can

286: be used to define further coarsenings.

287:

288: \secspace

289: \section{Results}

290: \secspace

291: In testing this algorithm for lattice QCD we generated quenched $U(1)$

292: gauge field configurations on a $128 \times 128$ lattice with the

293: standard Wilson gauge field action

294: \begin{displaymath}

295: S=\sum_{x,\nu<\mu} \beta\, {\rm Re}\, U_x^{\mu \nu} \equiv

296: \sum_{x,\nu<\mu} \beta\, {\rm Re}\, U_x^{\mu} U_{x+\hat \mu}^{\nu}

297: U_{x+\hat \nu}^{\mu \dagger} U_x^{\nu \dagger}

298: \end{displaymath}

299: and periodic boundary conditions at $\beta=6$ and $\beta=10$.  These

300: two values of $\beta$ define correlation lengths for the gauge field

301: to be $l_\sigma = 3.30 $ and $l_\sigma = 4.35$ respectively, via the

302: area law for the Wilson loop: $ W \sim \exp[ -A/l^2_\sigma]$. For

303: comparison on these lattices, a fermion mass gap \(\hat{m} = m -

304: \rmsub{m}{crit} = 0.01\) corresponds to the pseudoscalar meson

305: correlation lengths \(\mu^{-1} = 6.4 \) and \(\mu^{-1} = 12.7 \)

306: respectively.~\footnote{All quantities are expressed in lattice

307:   units.}  In the 2-dimensional \(U(1)\) gauge theory, one can

308: identify a gauge invariant topological charge \(\hat{Q}\), which in the

309: continuum limit is proportional to the quantized magnetic flux flowing

310: through the system.  A gauge field with nonzero \(\hat{Q}\) corresponds to a

311: Dirac operator with exactly real eigenvalues and, hence, as the mass

312: gap is brought towards zero the condition number becomes infinite.

313: Thus, it is important to test both trivial (\(\hat{Q}=0\)) and non-trivial

314: (\(\hat{Q}\neq 0\)) gauge field topologies.

315:

316: We blocked the lattice into $4 \times 4$ blocks and implemented the

317: adaptive \mg\ procedure described above.  We used a degree 2

318: polynomial smoother for our relaxation procedure, where the

319: coefficients were chosen by running two iterations of an underrelaxed

320: minimum residual solver (\(\omega=0.8\)) and subsequently held fixed

321: (hence, for our choice of smoother \(R=R^\dagger\)).  The coarsening

322: procedure was repeated twice maintaining $M=8$ vectors in all

323: coarsenings, down to an $8 \times 8$ lattice, over which the equations

324: were solved exactly.  For each gauge field we performed the set up

325: procedure for the \mg\ preconditioner for the lightest mass parameter

326: only, and reused the vectors for each heavier mass. We used this

327: constructed V-cycle as a preconditioner for CG where the operator

328: defined in Eq.~(\ref{eq:vcycle}) is applied at each iteration to the CG

329: direction vector (here on referred to as MG-CG).

330:

331: \begin{figure}

332:   \hfill

333:   \begin{minipage}[t]{0.47\textwidth}

334:     \includegraphics[width=1.05\textwidth]{beta6.eps}

335:     \caption{Number of Dirac operator applications of CG vs.~MG-CG as

336:       function of the mass gap at $\beta =6$ (point source, relative

337:       residual \(|r|=10^{-14}\)).}

338:     \label{fig:beta6}

339:   \end{minipage}

340:   \hfill

341:   \begin{minipage}[t]{0.47\textwidth}

342:     \includegraphics[width=1.05\textwidth]{beta10.eps}

343:     \caption{Number of Dirac operator applications of CG vs.~MG-CG as

344:       function of the mass gap at $\beta =10$ (point source, relative

345:       residual \(|r|=10^{-14}\)).}

346:     \label{fig:beta10}

347:   \end{minipage}

348:   \hfill

349: \end{figure}

350:

351: If one compares the number of CG iterations needed to achieve

352: convergence with or without \mg\ preconditioning, the gain obtained

353: with the \mg\ method is dramatic: for example, with \(\beta =6\),

354: \(\hat{m}=0.01\) and \(\hat{Q}=0\), it takes 3808 iterations to

355: achieve convergence with a straightforward application of the CG

356: technique, whereas it takes only 26 iterations using MG-CG.  However

357: this comparison does not take into account the fact that many more

358: operations per iterations must be performed when applying the \mg\

359: preconditioner.  To achieve a more balanced comparison, in

360: Figs.~\ref{fig:beta6},~\ref{fig:beta10} we plot the total number of

361: applications of \(D\) and \(D^\dagger\) done on the fine lattice.

362: This reflects better the actual cost of the calculations (at each

363: iteration of MG-CG there are 6 applications of \(D^\dagger D\): 1

364: application in the outer CG, and 2 pre- and 2 post- coarsening

365: smoothing applications and 1 further application required to form the

366: residual).  We do not include the additional cost arising from the

367: coarse lattices since this is expected to be a small overhead, and has

368: not been optimized for our model calculation.  The advantage coming

369: from the use of the adaptive \mg\ technique is still very dramatic:

370: critical slowing down, if not totally eliminated, is very

371: substantially reduced and there is no slow down as as the pion

372: correlation length exceeds the gauge field correlation length.  These

373: results are for point sources, however, we tried a variety of

374: different source vectors for this analysis (e.g., Gaussian noise,

375: \(Z_4\) noise) and found very little dependence of MG-CG performance

376: on the source vector.

377:

378: From the point of view of computational complexity, one should also

379: take into account the cost of setting up the \mg\ preconditioner,

380: i.e.,~of constructing the prolongator $P$.  This cost is however

381: heavily amortized, to the point of being negligible, if, as is often

382: the case, one must apply the solver to systems with multiple given

383: vectors (for example, solving for all color and spin components of a

384: quark propagator or, in the calculation of disconnected diagrams where,

385: \(O(1000)\) inverses are required to estimate the trace of the inverse

386: Dirac operator).

387:

388: \secspace

389: \section{Conclusion}

390: \secspace

391: Our results, albeit for now limited to a 2-dimensional example,

392: provide a clear indication that adaptive \mg\ can be made to work with

393: the lattice Dirac operator.  What appears to be at the root of its

394: success is that, although the modes responsible for slow convergence

395: of the Dirac solver on a fine lattice are not low wavenumber

396: excitations, like in the free case, their span can be well

397: approximated by a set of vectors of limited dimensionality on the

398: blocks that define the coarse lattice.\footnote{The observation that

399:   the space of slow modes may be of limited span is also at the root

400:   of a method recently proposed by L\"uscher in

401:   Ref.~\cite{Luscher:2007se}, although the technique there is quite

402:   different from the one we follow.}  Earlier

403: attempts~\cite{Brower:1991xv,Lauwers:1992cp} failed to eliminate

404: critical slowing down when the pseudoscalar length exceeded the

405: disorder length of the gauge field: $\mu^{-1} > l_\sigma$.  Adaptive

406: \mg\ finds the coarse subspaces through the iterative application of

407: the method itself.  It is of course crucial that the approximation to

408: the space of slow modes can be achieved with a small number of vectors

409: on the individual blocks, otherwise the application of the method

410: would not be cost effective.  But this appears to be the case in the

411: examples we studied and, if the results hold true in general, adaptive

412: \mg\ has the potential of substantially speeding up lattice QCD

413: simulations as the increase of available computational power leads one

414: to consider ever larger lattices.

415:

416: Current research is focussed on applying this adaptive \mg\ algorithm

417: to the Dirac operator directly, as opposed to the normal equations.

418: Here we are motivated to do so because of the reduced condition number

419: and the increased sparsity of the operator.  There are added

420: complications because the Dirac operator isn't Hermitian (this

421: requires that the restriction and prolongation operators are defined

422: using left and right vectors respectively), however, initial progress

423: is extremely promising.  The application of the method to large

424: 4-dimensional systems is in progress,

425:

426: Acknowledgments.  This research was supported in part under NSF grant

427: PHY-0427646 and DOE grants DE-FG02-91ER40676 and DE-FC02-06ER41440.

428:

429: \begin{thebibliography}{99}

430:

431: %\bibitem{Kaplan:1992bt}

432: %D.B.~Kaplan.

433: %\newblock A {M}ethod for simulating chiral fermions on the lattice.

434: %\newblock {\em Phys. Lett.}, B288:342--347, 1992.

435:

436: %\bibitem{Shamir:1993zy}

437: %Y.~Shamir.

438: %\newblock Chiral fermions from lattice boundaries.

439: %\newblock {\em Nucl. Phys.}, B406:90--106, 1993.

440:

441: %\bibitem{Neuberger:1998fp}

442: %H.~Neuberger.

443: %\newblock Exactly massless quarks on the lattice.

444: %\newblock {\em Phys. Lett.}, B417:141--144, 1998.

445:

446: \bibitem{Brandt:1977}

447: A.~Brandt.

448: %\newblock Multi-level adaptive solutions to boundary value problems.

449: \newblock {\em Math. Comp.}, 31:333--390, 1977.

450:

451: \bibitem{Brower:1991xv}

452: Richard~C. Brower, Robert~G. Edwards, Claudio Rebbi, and Ettore Vicari.

453: %\newblock Projective multigrid for {W}ilson fermions.

454: \newblock {\em Nucl. Phys.}, B366:689--705, 1991.

455:

456: \bibitem{Lauwers:1992cp}

457: P.~G. Lauwers and T.~Wittlich.

458: %\newblock Inversion of the fermion matrix in lattice {QCD} by means of parallel

459: %  transported multigrid {(PTMG)}.

460: \newblock {\em Int. J. Mod. Phys.}, C4:609--620, 1993.

461:

462: \bibitem{Borici:2007ft}

463:   A.~Borici,

464:   %``On Gauged Renormalisation Group Transformations of Lattice Fermions,''

465:   2007 (arXiv:0704.2341 [hep-lat]).

466:   %%CITATION = ARXIV:0704.2341;%%

467:

468: \bibitem{Brezina:2004}

469: M.~Brezina, R.~Falgout, S.~MacLachlan, T.~Manteuffek, S.~McCormick, and

470:   J.~Ruge.

471: %\newblock Adaptive smoothed aggregation ($\alpha${SA}).

472: \newblock {\em Siam J. Sci. Comput.}, 25:1896--1920, 2004.

473:

474: \bibitem{Brannick:2006}

475:   J.~Brannick, M.~Brezina, D.~Keyes, O.~Livne, I.~Livshits,

476:   S.~MacLachlan, T.~Manteuffel, S.~McCormick, J.~Ruge, and L.~Zikatanov.

477: %\newblock Adaptive smoothed aggregation in lattice qcd.

478: %\newblock In Olof~B. Widlund and David~E. Keyes, editors,

479: %  \newblock{\em Domain Decomposition Methods in Science and

480: %    Engineering XVI}, volume~55 of

481:   \newblock{\em Lecture Notes in Computational

482:     Science and Engineering}, 55:499--506, 2006.

483:

484: \bibitem{Luscher:2007se}

485: M.~L\"{u}scher.

486: %\newblock Local coherence and deflation of the low quark modes in lattice

487: %  {QCD}.

488: \newblock 2007 (arXiv:0706.2298 [hep-lat]).

489:

490: \end{thebibliography}

491:

492: \end{document}

493:

494:

495: