1:
2:
3: \appendix A. Statistical error analysis
4:
5: In the physics analysis of the runs $A_1-A_3$, $B_1-B_4$ and
6: $D_1-D_5$, we kept track of the statistical errors using
7: the jackknife method. In particular,
8: any correlations among the errors of different observables
9: were always properly taken into account.
10: Here we summarize our conventions and briefly explain
11: the basic procedures that we used.
12:
13: \subsection A.1 Jackknife samples
14:
15: Let $A_{r}$, $r=1,\ldots,R$, be a set of
16: primary stochastic observables and
17: $a_{r,1},\ldots,a_{r,N}$ a sequence of $N$ measured values of
18: these. In lattice QCD the most common primary observables are the Wilson
19: loops and sums of products of quark propagators.
20: The jackknife method assumes that
21: the measured values are unbiased and statistically independent.
22: We shall thus take it for granted that the residual autocorrelations
23: are negligible in the cases of interest (see sect.~4).
24:
25: The averages $\abar_r$ of the observables $A_r$
26: and the associated statistical error covariance $C_{rs}$
27: are given by
28: \equation{
29: \abar_r={1\over N}\sum_{i=1}^Na_{r,i},
30: \enum
31: \next{2ex}
32: C_{rs}={1\over N(N-1)}\sum_{i=1}^N
33: \left(a_{r,i}-\abar_r\right)
34: \left(a_{s,i}-\abar_s\right).
35: \enum
36: }
37: If we introduce
38: the jackknife samples
39: \equation{
40: a^J_{r,i}=\abar_r+c_N\left(\abar_r-a_{r,i}\right),
41: \qquad
42: c_N=\left(N(N-1)\right)^{-1/2},
43: \enum
44: }
45: an equivalent expression for the error matrix is
46: \equation{
47: C_{rs}=\sum_{i=1}^N
48: \left(a^J_{r,i}-\abar_r\right)
49: \left(a^J_{s,i}-\abar_s\right).
50: \enum
51: }
52: Note that our definition of the jackknife
53: samples slightly departs from
54: the standard conventions, where $c_N=1/(N-1)$.
55: The modification is numerically insignificant in practice, but
56: leads to some simplifications
57: when data from different simulations
58: are to be combined (see subsect.~A.3).
59:
60:
61: \subsection A.2 Error propagation
62:
63: Apart from estimating the primary observables, one may be
64: interested in evaluating various functions $f(A_1,\ldots,A_R)$
65: of them,
66: which may involve fit procedures and
67: other complicated operations.
68: The standard stochastic estimate of such an observable is
69: \equation{
70: \fbar=f(\abar_1,\ldots,\abar_R)
71: \enum
72: }
73: and the associated series of jackknife estimates is defined by
74: \equation{
75: f^J_i=f(\abar^J_{1,i},\ldots,\abar^J_{R,i}),
76: \qquad i=1,\ldots,N.
77: \enum
78: }
79: A little algebra then shows that the expression
80: \equation{
81: \sigma^2=
82: \sum_{i=1}^N
83: \left(f^J_i-\fbar\right)^2
84: \enum
85: }
86: provides an estimate of the statistical variance of $\fbar$, which
87: coincides with the usual error propagation formula (the one
88: that involves the
89: gradient of $f$) up to terms of order $1/N$. Similarly the
90: error covariance of $f$ and any other function $g$ is obtained by
91: summing $(f^J_i-\fbar)(g^J_i-\gbar)$
92: over the jackknife samples.
93:
94: In practice the error formula (A.7) proves to be very convenient.
95: If an observable is a function of previously calculated
96: observables, for example, one can take advantage of the
97: fact that the composition of functions is associative, i.e.~the
98: jackknife series $f^J_i$ is simply obtained
99: by inserting the jackknife series of the arguments,
100: independently of whether these are primary or not.
101: The data analysis can thus proceed in steps, starting from the
102: primary observables and progressing to more and more complicated
103: observables.
104:
105:
106: \subsection A.3 Combining data from different runs
107:
108: Simulations of lattice QCD at different
109: sea-quark masses, lattice spacings, etc., can be assumed to
110: be statistically independent. The statistical variance of any observable
111: that depends on data from several simulations is therefore the sum of the
112: associated partial variances.
113: This rule can easily be accommodated in the jackknife analysis
114: by embedding the jackknife series of the observables in extended
115: series that include all simulations on which the
116: observable depends.
117:
118: The method is best explained by considering two
119: simulations, where $N_1$ measurements
120: of some observables $A_r$ are made in the first
121: and $N_2$ measurements of some other observables $B_s$
122: in the second. The associated jackknife
123: series $a^J_{r,1},\ldots,a^J_{r,N_1}$ and $b^J_{s,1},\ldots,b^J_{s,N_2}$
124: are then computed as before,
125: starting from the primary observables in each simulation.
126: Next they are embedded in extended series
127: \equation{
128: a^J_{r,1},\ldots,a^J_{r,N_1},
129: \underbrace{\abar_r,\ldots,\abar_r}_{N_2\;{\rm elements}}
130: \qquad\hbox{and}\qquad
131: \underbrace{\bbar_s,\ldots,\bbar_s}_{N_1\;{\rm elements}}
132: b^J_{s,1},\ldots,b^J_{s,N_2}
133: \enum
134: }
135: of length $N_1+N_2$
136: such that the first $N_1$ elements are occupied by the jackknife
137: series from the first simulation and the last $N_2$ elements by those from the
138: second simulation.
139:
140: With this assignment, and if the extended
141: series are treated as ordinary jackknife series,
142: the correct error correlation matrix of the full set
143: $A_1,\ldots,A_R,B_1,\ldots,B_S$
144: of observables is obtained.
145: Moreover, we may define the jackknife series of
146: any observable $f(A_1,\ldots A_R,B_1,\ldots,B_S)$ in the standard
147: manner and compute its variance using eq.~(A.7).
148: The embedding trick thus allows the statistical errors
149: to be pro\-pa\-gated as if there were a single simulation.
150:
151:
152: