0801:0801.4525/ms.tex

1: \documentclass[aps,showkeys,groupedaddress,twocolumn,showpacs]{revtex4/revtex4}

2: \usepackage{graphicx}

3: \usepackage[ansinew]{inputenc}

4: \usepackage[tbtags]{amsmath}

5: \usepackage{amssymb}

6: \usepackage{epsfig}

7: \newcommand{\erfc}{\mbox{erfc}}

8: \newcommand{\erf}{\mbox{erf}}

9: \newcommand{\joint}{$\rho^{\Theta}_j(x_{n},x_{n+1},d)$ }

10: \newcommand{\ja}{$\rho^{\Theta}_j(x_{n+1},x_n,a,eta)$ }

11: \newcommand{\marg}{$\rho^{\Theta}_m(x_n,d)$ }

12: \newcommand{\ma}{$\rho^{\Theta}_m(x_n,a,\eta)$ }

13: \newcommand{\ptotal}{$\rho^{\Theta}(a,\eta)$ }

14: \newcommand{\antima}{$\bar{\rho}^{\Theta}_m(x_n,a,\eta)$ }

15: \begin{document}

16: \title{When are Extreme Events the better predictable, the larger they are?}

17: \author{S. Hallerberg, H. Kantz}

18: \affiliation{

19: Max Planck Institute for the Physics of Complex Systems\\

20: N\"othnitzer Str.\ 38, D 01187 Dresden, Germany\\

21: }

22:

23: \date{\today}

24: \begin{abstract}

25: We investigate the predictability of extreme events in time series.  The focus of this work is to understand under which circumstances large events are better predictable than smaller events.

26: %

27: Therefore we use a simple prediction algorithm based on precursory structures which are identified using the maximum likelihood principle.

28: %

29: Using the receiver operator characteristic curve as a measure for the quality

30: of predictions we find that the dependence on the event magnitude is closely linked

31: to the probability distribution function of the underlying stochastic

32: process.

33: %

34: We evaluate this dependence on the probability distribution function analytically

35: and numerically.

36: %

37: If we assume that the optimal precursory structures are used to make the

38: predictions, we find that large increments are better predictable if the

39: underlying stochastic process has a Gaussian probability distribution

40: function, whereas larger increments are harder to predict if the underlying

41: probability distribution function has a power law tail.

42: %

43: In the case of an

44: exponential distribution function we find no significant dependence on the event magnitude.

45: %

46: Furthermore we compare these results with predictions of increments in

47: correlated data, namely, velocity increments of a free jet flow. The velocity

48: increments in the free jet flow are in dependence on the time scale

49: either asymptotically Gaussian or asymptotically exponential distributed.

50: The numerical

51: results for predictions within free jet data are in good agreement with the previous analytical considerations for random numbers.

52: \end{abstract}

53: \pacs{02.50.-r,05.45.Tp}

54: \keywords{time series analysis, extreme events, extreme increments,

55:   statistical inference, likelihood ratio}

56: \maketitle

57: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%

58: \section{Introduction}

59: Systems with a complex time evolution, which generate

60: a great impact event from time to time, are ubiquitous.

61: %

62: Examples include

63: fluctuations of prices for financial assets in economy with rare market

64: crashes, electrical activity of human brain with rare epileptic seizures,

65: seismic activity of the earth with rare earthquakes, changing weather

66: conditions with rare disastrous storms, and also fluctuations of online

67: diagnostics of technical machinery and networks with rare breakdowns or

68: blackouts.

69: %

70: Due to the complexity of the systems mentioned, a complete

71: modeling is usually impossible, either due to the huge number of degrees of freedom involved, or due to a lack of precise knowledge about the governing

72: equations.

73: %

74: This is why one applies the framework of prediction via precursory structures

75: for such cases. The typical application for prediction with precursory

76: structures is a prediction of an event which occurs in the very near future,

77: i.e., on short timescales compared to the lifetime of the system.

78: %

79: A classical example for the search for precursory structures is the prediction

80: of earthquakes \cite{Jackson}.

81: %

82: A more recently studied example is the short

83: term prediction of strong turbulent wind gusts, which can  destroy wind

84: turbines \cite{Physa, Euromech}.

85:

86: In a previous work \cite{Sarah2}, we studied the quality of predictions

87: analytically via precursory structures for increments in an AR(1) process and

88: numerically in a long-range correlated ARMA process.

89: %

90: The long-range

91: correlations did not alter the general findings for Gaussian processes,

92: namely, that larger events are better predictable.

93:

94: Furthermore we found other works which report the same effect for earthquake

95: prediction \cite{Fatehme}, prediction of avalances in SOC-models

96: \cite{Sandpile} and in multiagent games \cite{Johnson1}.

97: %

98: In this contribution, we investigate the influence of the probability

99: distribution function (PDF) of the noise

100: term in detail by using not only Gaussian, but also exponential and power-law

101: distributed noise.

102: %

103: This approach is also motivated by the book of Egans \cite{Egans}

104: which explains that receiver operator characteristics (ROC) obtained in signal detection problems can be

105: ordered families of functions in dependence on a parameter.

106: %

107: We are now interested in learning how the behavior of these families of functions depends on the event magnitude and the distribution of the stochastic process,

108: if the ROC curve is used for evaluating the quality of predictions.\\

109: %

110: After defining the prediction scheme in Sec.\ \ref{pre} and the method for

111: measuring the quality of a prediction in Sec.\ \ref{roc},

112: %

113:  we explain in Sec.\ \ref{evsize} how to consider the influence on the event magnitude.

114: %

115: In Sec.\ \ref{constr} we formulate a constraint, which has to be fulfilled in order to find a better predictability of larger (smaller) events.

116: %

117: In the next section, we apply this constraint to compare the quality of

118: predictions of large increments within Gaussian (Sec.\ \ref{Gaussian}), exponential distributed

119: (Sec.\ \ref{symexpo}) and power-law distributed i.i.d.\ random numbers (Sec.\ \ref{powl}).

120: %

121: We study the prediction of increments in free jet data in Sec.\ \ref{freejet}.

122: %

123: Conclusions appear in Sec.\ \ref{conclusions}.

124: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%

125: \section{Definitions and setup\label{sec2}}

126: The considerations in this section are made for a time series \cite{Box-Jen,Brockwell}, i.e.\ , a set of measurements $x_n$ at discrete

127: times $t_n$, where $t_n = t_0 + n\Delta$ with a sampling

128: interval $\Delta$ and $ n \in \mathbb{N}$.

129: %

130: The recording should contain sufficiently many extreme events so that we are able to extract statistical

131: information about them.

132: %

133: We also assume that the event of

134: interest can be identified on the basis of the observations, e.g.\, by the

135: value of the observation function exceeding some threshold, by a sudden

136: increase, or by its variance exceeding some threshold.

137: %

138:  We express the presence (absence) of an event by using a binary variable $Y_{n+1}$.

139: %

140: \begin{eqnarray}

141: Y_{n+1} & = &\left\{  \begin{array}{ll}

142: 1 \quad& \mbox{an event occurred at time} \\

143: & n+1 \\

144:  0 \quad& \mbox{no event occurred at time} \\

145: & n+1\end{array} \right.

146: \end{eqnarray}

147: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%

148: \subsection{The choice of the precursor \label{pre}}

149: When we consider prediction via precursory structures ({\sl precursors},

150: or {\sl predictors}), we are typically in a situation where we assume that the

151: dynamics of the system under study has both, a deterministic and a stochastic

152: part.

153: %

154: The deterministic part allows one to assume that there is a relation between

155: the event and its precursory structure which we can use for predictive

156: purposes.

157: %

158: However, if the dynamic of the system was fully deterministic

159: there would be no need to predict via precursory structures, but we could

160: exploit our knowledge about the dynamical system as it is done, e.g., in weather forecasting.

161:

162: In this contribution we focus on the influence of the stochastic part of the

163: dynamics and assume therefore a very simple deterministic correlation between

164: event and precursor.%

165: The presence of this stochastic part determines that we cannot expect the precursor to preced {\em every} individual event.

166: %

167: That is why we define a precursor in this context as a data structure which is {\em typically} preceding an event, allowing deviations from the given structure, but also allowing events without

168: preceeding structure.

169:

170: For reasons of simplicity the following considerations are made for precursors

171: in real space, i.e., structures in the time series.

172: %

173: However, there is no reason not to apply the same ideas for precursory structures, which live in phase space.

174:

175: In order to predict an event $Y_{n+1}$ occurring at the time $(n+1)$ we compare the last $k$ observations, to which we will refer as the {\sl precursory variable}

176: %

177: \begin{equation}

178: {\mathbf x}_{(n-k+1,n)}= (x_{n-k+1},x_{n-k+2}, ..., x_{n-1},x_n)

179: \end{equation}

180: with a specific precursory structure

181: \begin{equation}

182: {\mathbf x}^{pre}= (x_{n-k+1}^{pre}, x_{n-k+2}^{pre}, ...,x_{n-1}^{pre}, x_n^{pre}).

183: \end{equation}

184: %

185: Once the precursory structure  ${\mathbf x}_{pre}$ is determined, we give an

186: alarm for an event $Y_{n+1}=1$ when we find the precursory variable

187: ${\mathbf x}_{(n-k+1,n)}$ inside the volume

188: %

189: %\begin{widetext}

190: \begin{eqnarray}

191: V^{pre}(\delta,{\mathbf x}^{pre}) & = & \prod_{j=n-k+1}^{n} \left(x_j^{pre}-\frac{\delta}{2},x_j^{pre}+\frac{\delta}{2}\right). \label{vol}

192: \end{eqnarray}

193: %\end{widetext}

194: %

195: There are different strategies to identify suitable precursory structures.

196: %

197: We choose the precursor via maximizing a conditional probability which we refer to as the {\sl likelihood} \cite{likelihood}.

198: %

199: \footnote{In this contribution we use the name likelihood for the probability that an event follows a precursor ${\mathbf x}$.

200: %

201: And the term {\sl aposterior pdf} for the probability to find a precursor ${\mathbf x}$ before of an already observed extreme event.

202: %

203: Note that the names might be also used vice versa,  if one refers to the precursor as the previously observed information.}

204: %

205: The likelihood

206: \begin{eqnarray}

207: L(Y_{n+1}=1|{\mathbf x}_{(n-k+1,n)}) = \frac{j(Y_{n+1}=1,{\mathbf

208:     x}_{(n-k+1,n)})}{\rho({\mathbf x}_{(n-k+1,n)})} \label{likely}

209: \end{eqnarray}

210:  provides the probability that an event $Y_{n+1}=1$ follows the precursor ${\mathbf x}_{(n-k+1,n)}$.

211: %

212:  It can be

213: calculated numerically by using the joint PDF $j((Y_{n+1}=1),{\mathbf

214:   x}_{(n-k+1,n)})$.

215: %

216: Our prediction strategy consists of determining those values of each

217: component $x_i$ of ${\mathbf x}_{(n-k+1,n)}$ for which the likelihood is

218: maximal.

219:

220: This strategy to identify the optimal precursor represents a rather fundamental choice.

221: %

222: In more applied examples one looks for precursors which minimize or maximize

223: more sophisticated quantities, e.g., discriminant functions or loss matrices.

224: %

225: These quantities are usually functions of the posterior PDF or the

226: likelihood, but they take into account the additional demands of the

227: specific problem, e.g., minimizing the loss due to a false prediction.

228: %

229: The strategy studied in this contribution is thus fundamental in the

230: sense that it enters into many of the more sophisticated quantities which

231: were used for predictions and decision making.

232: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%

233: \subsection{Testing for predictive power\label{roc}}

234: A common method to verify a hypothesis or to test the quality of a prediction

235: is the receiver operating characteristic curve (ROC) \cite{Swets1, Egans, Pepe}.

236: %

237: The idea of the ROC consists simply of comparing the rate of correctly

238: predicted events $r_{c}$ with the rate of false alarms $r_{f}$ by plotting

239: $r_c$ vs.\ $r_f$.

240: %

241: The rate of correct predictions $r_c$ and the rate of false

242: alarms $r_f$ can be obtained by integrating the {\sl aposterior} PDFs $\rho({\mathbf

243:   x}_{(n-k+1,n)}|Y_{n+1}=1)$ and $\rho({\mathbf x}_{(n-k+1,n)}|Y_{n+1}=0)$ on the

244: precursory volume.

245: %

246: \begin{eqnarray}

247: r_c(\delta,{\mathbf x}^{pre}) & = & \int  \rho({\mathbf x}_{(n-k+1,n)}|Y_{n+1}=1)

248: dV^{pre}(\delta,{\mathbf x}^{pre}) \nonumber

249: \label{rcor}\\

250: \\

251: r_f(\delta,{\mathbf x}^{pre}) & = & \int  \rho({\mathbf x}_{(n-k+1,n)}|Y_{n+1}

252: =0) dV^{pre}(\delta,{\mathbf x}^{pre})\nonumber\\

253: \label{rf}

254: \end{eqnarray}

255: %

256:  Note that these rates are defined with respect to the total

257: numbers of events $Y_{n+1}=1$ and nonevents $Y_{n+1}=0$.

258: %

259: Thus the relative frequency of events has no direct influence on the ROC, unlike on other measures of predictability, as e.g., the Brier score or the ignorance\cite{bandi}.

260:

261: Plotting $r_c$ vs $r_f$ for increasing values of $\delta$ one obtains a curve

262: in the unit-square of the $r_f$-$r_c$ plane (see, e.g., Fig.\ \ref{fig:rocgauss}).

263: %

264: The curve approaches the origin for $\delta \rightarrow0$ and the point $(1,1)$ in the limit $\delta \rightarrow \infty$, where $\delta$ accounts for the magnitude of the precursory volume $V_{pre}(\delta)$.

265: %

266: The shape of the curve characterizes the significance of the prediction. A curve above the diagonal reveals that the corresponding strategy of prediction

267: is better than a random prediction which is characterized by the

268: diagonal. Furthermore we are interested in curves which converge as fast as

269: possible to $1$, since this scenario tells us that we reach the highest

270: possible rate of correct prediction without having a large rate of false

271: alarms.

272:

273: That is why we use the so-called {\it likelihood ratio} as a summary index, to

274: quantify the ROC.

275: %

276: For our inference problems the likelihood ratio

277: is identical to the slope $m$ of the ROC-curve at the vicinity of the origin

278: which implies $\delta \rightarrow 0 $.

279: %

280: This region of the ROC is in particular interesting, since it corresponds to a low rate of false alarms.

281: %

282: The term likelihood ratio results from signal detection theory.

283: %

284: In the context of signal detection theory, the term {\sl a posterior PDF} refers to the PDF, which we call likelihood in the context of predictions and vice versa.

285: %

286: This is due to the fact that the aim of signal detection is to identify a

287: signal which was already observed in the past, whereas predictions are made

288: about future events.

289: %

290: Thus the \lq\lq likelihood ratio" is in our notation a ratio of a posterior PDFs.

291: %

292: \begin{eqnarray}

293: m & = & \frac{\Delta r_c}{\Delta r_f} \sim  \frac{\rho({\mathbf x}^{pre}|Y_{n+1}=1)}{\rho({\mathbf x}^{pre}|Y_{n+1}=0)} + \mathcal{O}(\delta)\label{defm}.

294: \end{eqnarray}

295: %

296: However, we will use the common name likelihood ratio throughout the text.

297: For other problems the name likelihood ratio is also used for the slope at every point of the ROC.

298: %

299: Since we apply the likelihood ratio as a

300: summary index for ROC, we specify, that for our purposes the term likelihood ratio

301: refers only to the slope of the ROC curve at the vicinity of the

302: origin as in Eq.\ (\ref{defm}).

303:

304: %

305: Note, that one can show that the precursor, which maximizes the

306: likelihood as explained in Sec.\ \ref{pre} also maximizes the $m$ and is in

307: this sense the optimal precursor.

308: %

309: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%5

310: \subsection{Addressing the dependence on the event magnitude \label{evsize}}

311: We are now interested in learning how the predictability depends on the event magnitude

312: $\eta$ which is measured in units of the standard deviation of the time

313: series under study.

314: %

315: Thus the event variable $Y_{n+1}$ becomes dependent on the event

316: magnitude

317: \begin{eqnarray}

318: Y_{n+1}(\eta) & = &\left\{  \begin{array}{ll}

319: 1\quad&\mbox{ \small an event of magnitude $\eta$ or larger }\\

320: & \mbox{ occurred at time $n+1$} \\

321:  0\quad&\mbox{ \small no event of magnitude $\eta$ or larger}\\

322: &\mbox{  occurred at time $n+1$}\end{array} \right.

323: \end{eqnarray}

324:

325: Via Bayes' theorem the likelihood ratio can be

326: expressed in terms of the likelihood $L\bigl(Y_{n+1}(\eta)=1|{\mathbf

327:   x}_{pre}\bigr)$ and the total probability to find events

328: $P\bigl(Y_{n+1}(\eta)=1\bigr)$.

329: %

330: Inserting the technical details of the calculation of the likelihood and the

331: total probability (see the appendix) we can see that the likelihood

332: ratio depends sensitively on the joint PDF

333: $j(\mathbf{x}_{(n-k+1,n)},Y_{n+1}(\eta)=1)$ of pecursory variable and event.

334: %

335: \begin{widetext}

336: \begin{eqnarray}

337: m (Y_{n+1}(\eta),\mathbf{x}_{(n-k+1,n)})& = &\frac{\left(1 - \int_{-\infty}^{\infty} d\mathbf{x}_{(n-k+1,n)} \; j(\mathbf{x}_{(n-k+1,n)},Y_{n+1}(\eta)=1)\right)}{\int_{-\infty}^{\infty} d\mathbf{x}_{(n-k+1,n)} \; j(\mathbf{x}_{(n-k+1,n)},Y_{n+1}(\eta)=1)}

338: %\times \nonumber\\

339: %&\it{}&

340: \frac{  \frac{j(\mathbf{x}_{(n-k+1,n)},Y_{n+1}(\eta)=1)}{\rho(\mathbf{x}_{(n-k+1,n)})}

341: }{\left(1 -

342:   \frac{j(\mathbf{x}_{(n-k+1,n)},Y_{n+1}(\eta)=1)}{\rho(\mathbf{x}_{(n-k+1,n)})}\right)} \label{mgeneral}, \nonumber \\

343: \mbox{with} &\it{} & j(\mathbf{x}_{(n-k+1,n)},Y_{n+1}(\eta)=1) =

344: \int_{\mathcal{M}} \; dx_{n+1}\;

345: j(\mathbf{x}_{(n-k+1,n)},x_{n+1}),\quad \quad  \nonumber\\

346: & \it{}& \mathcal{M} = \{ x_{n+1}: Y_{n+1}=1 \},\\

347: \mbox{and} & \it{}&\quad \rho(\mathbf{x}_{(n-k+1,n)}) =

348:   j(\mathbf{x}_{(n-k+1,n)},Y_{n+1}(\eta)=1) +

349:   j(\mathbf{x}_{(n-k+1,n)},Y_{n+1}(\eta)=0). \nonumber

350: \end{eqnarray}

351: \end{widetext}

352: %

353: Hence once the precursor is chosen, the dependence on the event

354: magnitude $\eta$ enters into the likelihood ratio, via the joint PDF of event and precursor.

355: %

356: Looking at the rather technical formula in Eq.\ (\ref{mgeneral}), there are

357: two aspects, which we find remarkable:

358: \begin{itemize}

359: \item[(I)] The slope of the ROC curve is fully characterized by the knowledge of

360:   the joint PDF of precursory variable and event.

361:   %

362:  This implies that in the framework of

363:   statistical predictions all kinds of (long-range) correlations which might be

364:   present in the time series influence the quality of the predictions

365:   only through their influence on the joint PDF.

366: %

367: \item[(II)] The definition of the event, e.g., as a threshold crossing or an

368: increment does change this dependence only insofar as it enters into the

369: choice of the

370: precursor and it influences also the set on which the integrals in Eq.\ (\ref{mgeneral}) are

371: carried out.

372: %

373: Both $Y_{n+1}(\eta)$ and the set $\mathcal{M}$ have

374: to be defined according to the type of events one predicts.

375: %

376:  When predicting, e.g., increments $x_{n+1} - x_n \geq \eta $ via the precursory variable $x_n$, then $\mathcal{M}=[a,b]$ with

377: $ a(Y_{n+1}(\eta)) = x_n + \eta$ for the lower border and $ b(Y_{n+1}(\eta)) =

378: \infty$ for the upper border.

379: %

380: In order to predict threshold crossings at $x_{n+1}$ via $x_n$ one uses  $ a(Y_{n+1}(\eta)) = \eta$, $ b(Y_{n+1}(\eta)) = \infty$.

381: \end{itemize}

382: %

383:

384: Exploiting Eq.\ (\ref{mgeneral}) we can hence determine the dependence of the

385: likelihood ratio and the ROC curve on the events magnitude $\eta$, via the

386: dependence of the joint PDF of the process under study.

387: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%

388: \subsection{Constraint for increasing quality of predictions with increasing

389: event magnitude \label{constr}}

390: %

391: In order to study the dependence of the likelihood ratio on the event magnitude we are going to introduce a constraint which the likelihood and the total

392: probability to find events have to fulfill in order to find a better

393: predictability of larger (smaller) events.

394:

395: In order to improve the readability of the paper, we will first introduce

396: the following notations for %

397: the aposterior PDFs, the likelihood and the total probability to find events

398: \begin{eqnarray}

399: \rho_c({\mathbf x}_{(n-k+1,n)}, \eta) & = & \rho({\mathbf

400:   x}_{(n-k+1,n)}|Y_{n+1}(\eta)=1), \; \\

401: \rho_f ({\mathbf x}_{(n-k+1,n)}, \eta) & = &  \rho({\mathbf

402:   x}_{(n-k+1,n)}|Y_{n+1}(\eta)=0),\; \\

403: L (\eta, {\mathbf x}_{(n-k+1,n)})& = & L(Y_{n+1}(\eta)=1|{\mathbf

404:   x}_{(n-k+1,n)}), \;\\

405: P(\eta) & = & P(Y_{n+1}(\eta)=1).

406: \end{eqnarray}

407:  We can then ask for the change of the

408: likelihood ratio with changing event magnitude $\eta$.

409: \begin{equation}

410: \frac{\partial}{\partial \eta}\, m(Y_{n+1}(\eta),\mathbf{x}_{(n-k+1,n)})  \gtreqqless0.

411: \end{equation}

412: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%

413: The derivative of the likelihood ratio is positive (negative, zero), if the following sufficient condition $c(\eta)$ is fulfilled.

414: \begin{widetext}

415: \begin{equation}

416: c(\eta, {\mathbf x}_{(n-k+1,n)}) = \frac{\partial}{\partial \eta}\ln L(\eta,{\mathbf x}_{(n-k+1,n)})-

417: \frac{\bigl(1-L(\eta,{\mathbf x}_{(n-k+1,n)}))\bigr)}{\bigl(1-P(\eta)\bigr)}\;\frac{\partial}{\partial

418: \eta}\ln P(\eta) \gtreqqless 0. \label{c2b}

419: \end{equation}

420: \end{widetext}

421: Hence one can tell for an arbitrary process, if extreme events are better

422: predictable, by simply  testing, if the marginal PDF of the event and the

423: likelihood of event and precursor fulfill Eq.\ (\ref{c2b}).

424:

425: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%

426: \section{Predictions of Increments in i.i.d.\ random numbers \label{num}}

427: %

428: In this section we test the condition $c(\eta,{\mathbf x}_{(n-k+1,n)})$ as given in Eq.\ (\ref{c2b}) for increments in Gaussian, power-law,

429:  and exponentially distributed i.i.d.\ random numbers.

430: %

431:  We thus concentrate on extreme events which consist of a sudden increase (or decrease) of the observed variable within a

432: few time steps.

433: %

434: Examples of this kind of extreme events are the increases in

435: wind speed in \cite{Physa,Euromech}, but also stock market

436: crashes \cite{stocks,Sornette2} which consist of sudden decreases.

437:

438: We define our extreme event by an increment $x_{n+1}-x_n$

439: exceeding a given threshold $\eta$

440: %

441: \begin{equation}

442:  x_{n+1}-x_n \geq   \eta, \label{e0}

443: \end{equation}

444: %

445: where $x_{n}$ and $x_{n+1}$ denote the observed values at two consecutive time

446: steps and the event magnitude $\eta$ is again measured in units of the standard deviation.

447:

448:  Since the first part of the increment $x_n$ can be used as a precursory variable, the definition of the event as an increment introduces a correlation between the event and the precursory variable

449: $x_n$.

450: %

451: Hence the prediction of increments in random numbers provides a simple, but not unrealistic example which allows us to study the influence of the

452:  distribution of the underlying process on the event-magnitude dependence of the

453:  quality of prediction.

454:

455: In the examples which we study in this section the joint PDF of

456: precursory variable and event is known and we can hence evaluate $c(\eta,x_n)$

457: analytically.

458: %

459: A mathematical expression for a filter which selects the PDF of our

460: extreme events out of the PDFs of the underlying stochastic process can be

461: obtained through applying the  Heaviside function $ \Theta( x_{n+1} - x_{n}

462: -\eta)$ to the joint PDF.

463: %

464: This method is described in more detail in the appendix.

465:

466: Since in most cases the structure of the PDF is not known analytically, we are also

467: interested in evaluating

468: $c(\eta,x_n) $ numerically.

469: %

470: In this case the approximations of the total probability and the likelihood are obtained by \lq\lq binning and counting" and

471: their numerical derivatives are evaluated via a

472: Savitzky-Golay filter \cite{Savgol,numrecipes}.

473: %

474: The numerical evaluation is done within $10^7$ data points. In order to check

475: the stability of this procedure, we evaluate $c(\eta,x_n)$ also on $20$

476: bootstrap samples which are generated from the original data set.

477: %

478: These bootstrap samples consist of $10^7$ pairs of event and precursory variable, which were drawn randomly from the

479: original data set.

480: %

481: Thus their PDFs are slightly different in their first and second

482: moment and they contain different numbers of events.

483: %

484: Evaluating $c(\eta, x_n)$ on the bootstrap samples thus shows how

485: sensitive our numerical evaluation procedure is towards changes in the

486: numbers of events. This is especially important for large and therefore rare

487: events.

488:

489: In order to check the results obtained by the evaluation of $c(\eta,x_n) $, we

490: compute also the corresponding ROCs analytically and numerically.

491:

492: Note that for both, the numerical evaluation of the condition and the

493: ROC curves, we used only

494: event magnitudes $\eta$ for which we found at least $1000$ events, so that the

495: observed effects are not due to a lack of statistics of the large events.

496: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%

497: \subsection{Gaussian distributed random numbers \label{Gaussian}}

498: %%%%%%%%%%%%%%%

499: \begin{figure}

500: \epsfig{file=figure1.eps,width=8cm}

501: \caption{The condition $c(\eta,x_n)$ for the

502:   Gaussian distribution as given by Eq.\ (\ref{cgauss}).

503: %

504: The color shaded regions indicate the intervals $[-\sigma \eta, -\eta/2]$ for

505: which we can according to Eq.\ \ref{gausseins} expect

506: $c(\eta,x_n)$ to be positive.

507: %

508: If $x_n < -\sigma \eta $, $\eta >

509: 2\sqrt{\pi}$ and terms of the order of $\exp(-(x_n + \sigma \eta)^2)$ are

510: sufficiently small, the condition is also positive.

511: %

512: If terms of the order of $\exp(-(x_n + \sigma \eta)^2)$ cannot be neglected one also might find

513: small regions in $(-\infty, -\sigma \eta]$ for which $c(\eta,x_n) <0$.

514: %

515: However, the influence of these regions

516: is neglectable, since our

517: alarm interval is defined as $[-\infty,\delta]$ which implies an averaging

518: over several possible values of the precursory variable. \label{fig:cgauss}

519: }

520: \end{figure}

521: %%%%%%%%%%%%%%%%

522: In the first example we assume the sequence of i.i.d.\ random numbers which

523: form our time series to be normal distributed.

524: %

525: As we know from \cite{Sarah2}, increments

526: within Gaussian random numbers are better predictable, the more extreme

527: they are.

528: %

529: In this section we will show that their PDFs fulfill also the

530: condition in Eq.\ (\ref{c2b}).

531: Applying the filter mechanism developed in the appendix we obtain the following expressions for the a posteriori PDFs

532: \begin{equation}

533: \rho_c(x_n,\eta) = \frac{\exp\left(-\frac{x_{n}^2}{2\sigma^2}\right)}{2\sqrt{2\pi}\sigma

534:   P(\eta)}\mbox{erfc}\left(\frac{x_{n}+\sigma\eta}{\sigma\sqrt{2}}\right),\label{marga}

535: \end{equation}

536: and the likelihood

537: \begin{equation}

538: L(\eta,x_n)  =  \frac{1}{2}\erfc\left(\frac{x_n +

539:   \sigma\eta}{\sigma\sqrt{2}}\right). \label{gausslikely}

540: \end{equation}

541: %

542: We recall that the optimal precursor is given by the value of $x_n$ which

543: maximizes the likelihood. %

544: We refer to this special value of the variable $x_n$ by $x_{pre}$ and find for

545: the likelihood according to Eq.\ (\ref{gausslikely}) $x_{pre}=-\infty$.

546: %

547: Thus, instead of a finite alarm volume $\delta$

548: here is the upper limit of the interval $[-\infty, \delta ]$.

549: %

550: %

551: The total probability to find increments of magnitude $\eta$ is given by

552: %

553: \begin{equation}

554: P(\eta) = \frac{1}{2} \erfc(\eta/2).\label{gaussptotal}

555: \end{equation}

556: %%%%%%%%%%%%%%%

557: \begin{figure}

558: \epsfig{file=figure2.eps ,width=8cm}

559: \caption{Comparison of the numerically

560:   evaluated condition $c(\eta,x_n)$ for the Gaussian

561: distribution and the expression given by Eq.\ (\ref{cgauss}).

562: %

563: The black curves denote the evaluation of the analytic result in Eq.\

564:   (\ref{cgauss}), the curves plotted with lines and symbols represent the numerical results

565:   obtained from the original data set, and the dashed lines represent the

566:   results obtained from the corresponding bootstrap samples.

567: %

568: The gray (green in the colored plot) regions indicate the regime $-\sigma \eta

569: < x_n < -\sigma \eta /2$ for which $c(\eta, x_n)$ is positive in the limit

570: $\eta \rightarrow \infty$.

571: %

572: The numerical

573: evaluation of $c(\eta,x_n)$ was done by sampling the likelihood and the total

574: probability of events from $10^7$ random numbers.\label{fig:cnumgauss}}

575: \end{figure}

576: %%%%%%%%%%%%%%%%

577: Hence the condition in Eq.\ (\ref{c2b}) reads

578: \begin{eqnarray}

579: c(\eta,x_n) & = &-\sqrt{\frac{2}{\pi}}\frac{\exp\left(-z^2\right)}

580: {\erfc\left(z\right)} \quad  \nonumber\\

581: &\it{}& + \frac{1}{\sqrt{\pi}} \frac{\exp\left(-\frac{\eta^2}{4}\right)}{\erfc\left(\frac{\eta}{2}\right)}

582: \frac{\left(1 -\frac{1}{2}\erfc\left(z\right) \right)}

583: {\left( 1- \frac{1}{2} \erfc\left(\frac{\eta}{2}\right)\right)}, \nonumber\\

584: &\it{}& \mbox{with}\quad z=\frac{x_n + \sigma \eta}{\sqrt{2}\sigma}  \label{cgauss}

585: \end{eqnarray}

586: Fig.\ \ref{fig:cgauss} illustrates this expression

587: and Figure \ref{fig:cnumgauss} compares it to the numerical results.

588: %

589: For the ideal precursor $x_{n} = x_{pre} = -\infty $ the condition $c(\eta,x_n)$ is

590: ---according to Eq.\ (\ref{cgauss})---

591: zero, since in this case,

592: the slope of the ROC-curve tends to infinity \cite{Sarah2} and does not react to any

593: variation in $\eta$.

594: %

595: For any finite value of the precursory variable $x_{n} <0$ we have to distinguish three regimes

596: of $z=(x_n + \sigma \eta)/\sqrt{2}\sigma$, namely, $z \rightarrow \infty$ or $z

597: \rightarrow -\infty$ and finally also the case $z=0$.

598:

599:

600: In the first case we study the

601: behavior of $c(\eta,x_n)$ for a fixed value of the precursory variable $-\sigma \eta <x_n$ and $\eta

602: \rightarrow \infty$.

603: %

604: This implies that $z\rightarrow \infty$ and we can use

605: the asymptotic expansion for large arguments of the complementary

606: error function

607: %%%%%%%%%%%%

608: \begin{figure}[t!!!]

609: \includegraphics[width=9cm]{figure3.eps}

610: \caption[]{\small\label{fig:rocgauss} ROCs for Gaussian distributed i.i.d.\ random variables.

611: %

612: The symbols

613: represent ROC curves which where made via predicting increments in $10^7$ normal

614: i.i.d.\ random numbers.

615: %

616:  The predictions were made according to the prediction

617: strategy described in Sec.\ \ref{pre}. The lines represent the results of evaluating

618: the integrals in Eqs.\ (\ref{rcor}) and (\ref{rf}) for the Gaussian case. Note that the quality of the

619: prediction increases with increasing event magnitude.}

620: \end{figure}

621: %%%%%%%%

622: %

623: \begin{eqnarray}

624: \erfc(z) &\sim &  \frac{\exp(-z^2)}{\sqrt{\pi}z} \left(1 + \sum_{m=1}^{\infty}

625: (-1)^m \frac{1 \cdot 3 ...(2m-1)}{(2z^2)^m }\right),\nonumber\\

626: &\it{}& \left( z \rightarrow \infty, |

627: \mbox{arg} z | < \frac{3\pi}{4} \right)\label{erfcapprox}

628: \end{eqnarray}

629: %

630: which can be found in \cite{Abram} to obtain

631: %

632: \begin{eqnarray}

633: c(\eta, x_n) & \propto & -\frac{x_n}{\sigma} + \frac{\eta}{2},\quad -\sigma \eta < x_n <0. \label{gausseins}

634: \end{eqnarray}

635: %

636: This expression is appropriate for $x_n > -\sigma \eta$ since the

637: asymptotic expansion in Eq.\ (\ref{erfcapprox}) holds only if the argument of

638: the complementary error function is positive.

639: %

640: In this case $c(\eta, x_n)$ is larger than zero, if $x_n$ is fixed and finite and $-\sigma \eta < x_n < -\sigma \eta/2$.

641: %

642:

643: In the second case, we assume $\eta \gg 1$ to be fixed, $x_n <-\sigma \eta$ and $x_n

644: \rightarrow -\infty$.

645: %

646: Hence we can use the expansion in Eq. (\ref{erfcapprox}) only to obtain

647: the asymptotic behavior of the dependence on $\eta$ and not for the dependence

648: on $z$.

649: %

650: An asymptotic expression of $c(\eta, x_n)$ hence reads

651: %

652: \begin{eqnarray}

653: c(\eta, x_n) & \propto & \frac{\eta}{2\left(1 - \frac{1}{2}

654:   \erfc\left(\eta/2\right) \right)} \left(\frac{\erf(z)}{\sqrt{pi}} +

655:   \frac{\eta}{2} \right) \nonumber\\

656:  &\it{}& - \mathcal{O}\left(\exp(-z^2) \right), \quad x_n < -\sigma

657:   \eta. \label{gausszwei}

658: \end{eqnarray}

659: %

660: Since $\erf(z)$ tends to minus unity as $z \rightarrow -\infty$ the expression

661: in Eq. (\ref{gausszwei}) is positive if $\eta > 2\sqrt{\pi}$ and if we can

662: assume the squared exponential term to be sufficiently small.

663: %

664: If the later assumption is not fullfilled one might observe some regions of

665: intermediate values of $-\infty< x_n < -\sigma \eta$, for which $c(\eta,x)$ is

666: negative.

667:

668: However the ROC curves in Fig.\ \ref{fig:rocgauss} suggest that the influence

669: of these regions is sufficiently small, if the alarm volume is chosen to be $[-\infty, \delta ]$.

670: %

671: We can understand this effect, if we keep in mind that we use the interval

672: $[-\infty, \delta]$ as an alarm volumen.

673: %

674: Hence we can expect that the influence of the regions, where $c(\eta, x_n)$ is negative, is suppressed since we average over many different values of $x_n$ and the condition is positive as $x_n \rightarrow -\infty$.

675: %

676: (Positive is meant here

677: in the sense, that $c(\eta,x_n)$ approaches the value zero for $x_n = -\infty$

678: from small positive numbers.)

679:

680:

681: In the third case, for $x_n = -\sigma \eta$ and hence $z=0$ we find that $c(\eta,x_n)$ is

682: positive if $\eta > 2\sqrt{\frac{2}{\pi}} \left(1 - \frac{1}{2}\erfc(\eta/2) \right)$.

683: %

684:

685: In total we can expect larger increments in Gaussian random numbers to be easier

686: to predict the larger they are.

687: %

688: The ROCs in Fig.\ \ref{fig:rocgauss} support these results.

689:

690: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%

691:

692: \subsection{Symmetrized exponential distributed random variables

693:   \label{symexpo}}

694: The PDF of the symmetrized exponential reads

695: \begin{eqnarray}

696: \rho(x) & = & \frac{\lambda}{2} \exp(-\lambda |x_n|) = \left\{ \begin{array}{l@{\quad:\quad}l} \frac{\lambda}{2} \exp(-\lambda x_n) & x_n

697: >0, \\ \lambda/2 &  x_n =0,\\ \frac{\lambda}{2} \exp(\lambda x_n) & x_n

698: <0, \end{array} \right.  \nonumber

699: \end{eqnarray}

700: %%the factor $\frac{1}{2}$ comes from normalization over the whole real axis.

701: with $\mu =0$, $\sigma = \sqrt{2}/\lambda$.

702:

703: Applying the filtering mechanism

704: according to the appendix we find the joint PDFs of precursory variable and event

705: \begin{widetext}

706: \begin{eqnarray}

707: j(x_n,(Y_{n+1}(\eta)=1)) & = & \left\{ \begin{array}{l@{\quad :\quad}l}

708: \frac{\lambda}{4}\exp(-\sqrt{2} \eta -2\lambda x_n) & x_n >0,\\

709: \frac{\lambda}{4}\exp(-\sqrt{2}\eta) & -\eta <x_n <0,\\

710:  \frac{\lambda}{2} \left(\exp(\lambda x_n) -\frac{1}{4}\exp\left(\sqrt{2}\eta

711:      +\lambda 2x_n \right) \right) & x_n < - \eta <0,\\

712: \end{array} \right.

713: \end{eqnarray}

714: \end{widetext}

715: %

716: \begin{widetext}

717: the aposterior probabilities,

718: \begin{eqnarray}

719: \rho_c(x_n, \eta, \lambda) & = & \left\{\begin{array}{l@{\quad :\quad}l}

720:   \frac{\lambda}{(2 + \sqrt{2}\eta}\exp( -2\lambda x_n) & x_n >0,\\

721: \frac{\lambda}{(2+\sqrt{2}\eta}& -\eta <x_n <0,\\

722:  \frac{\lambda}{(2+ \sqrt{2}\eta} \left(2\exp(\sqrt{2}\eta + \lambda x_n) -\exp\left(2\sqrt{2}\eta

723:      +2\lambda x_n \right) \right) & x_n < - \eta <0,\\

724: \end{array} \right. \label{symexpapostc}\\

725: %

726: \rho_f(x_n,\eta, \lambda) & = & \left\{\begin{array}{l@{\quad :\quad}l}

727:   \frac{\lambda}{2}\exp( -\lambda x_n) \frac{\left(1 - \frac{1}{2}

728:   \exp(-\lambda x_n - \sqrt{2}\eta) \right)}{\left(1 -

729:   \frac{1}{2}\left( 1 + \frac{\eta}{2}\right)  \exp(-\sqrt{2}\eta) \right)} & x_n >0,\\

730:  \frac{\lambda}{2}\exp( \lambda x_n) \frac{\left(1 - \frac{1}{2}

731:   \exp(-\lambda x_n - \sqrt{2}\eta) \right)}{\left(1 -

732:   \frac{1}{2}\left( 1 + \frac{\eta}{2}\right)  \exp(-\sqrt{2}\eta) \right)} & -\eta <x_n <0,\\

733:  \frac{\lambda}{4} \frac{\exp(2\lambda x_n + \sqrt{2}\eta)}{\left(1 -

734:   \frac{1}{2}\left( 1 + \frac{\eta}{2}\right)  \exp(-\sqrt{2}\eta) \right)} & x_n < - \eta <0,\\

735: \end{array} \right. \label{symexpapostf}

736: \end{eqnarray}

737: \end{widetext}

738: \begin{widetext}

739: the likelihood

740: \begin{eqnarray}

741:  L(\eta,x_n, \lambda) & = & \left\{ \begin{array}{l@{\quad :\quad}l}

742: \frac{1}{2}\exp(-\sqrt{2} \eta -\lambda x_n) & x_n >0,\\

743: \frac{1}{2}\exp(-\sqrt{2}\eta - \lambda x_n) & -\eta <x_n <0,\\

744:  1 -\frac{1}{2}\exp(\sqrt{2}\eta  +\lambda x_n ) & x_n < - \eta <0,\\  \end{array} \right. \label{symexplike}

745: \end{eqnarray}

746: \end{widetext}

747: %

748: and the total probability to find events of magnitude $\eta$

749: %\begin{widetext}

750: \begin{eqnarray}

751: P(\eta) & = & \left\{ \begin{array}{l@{\quad :\quad}l}

752: \frac{1}{8}\exp(-\sqrt{2} \eta) & x_n >0,\\

753: \frac{\sqrt{2}}{4}\eta \exp(-\sqrt{2}\eta) & -\eta <x_n <0,\\

754: \frac{3}{8}\exp(-\sqrt{2}\eta) & x_n < - \eta <0.  \end{array}\right. \quad

755: \end{eqnarray}

756: %\end{widetext}

757: If we are not interested in the range of the precursory variable, the total probability

758: to find events is given by

759: \begin{equation}

760: P(\eta)  =  \frac{1}{2}\exp(-\sqrt{2}\eta) \left(1 + \frac{\eta}{\sqrt{2}} \right).

761: \end{equation}

762: %

763: Hence the condition $c(\eta, x_n, \lambda )$ reads

764: \begin{widetext}

765: %

766: %%%%%%%%%%%%%

767: \begin{figure*}[t!!!]

768: \parbox{18cm}{

769: %\setlength{\fboxsep}{}

770: \parbox{8.5cm}{

771: \epsfig{file=figure4.eps, width=8cm}

772: \caption{The numerically and analytically evaluated condition

773:   for the symmetrized exponential. The black line is the result of the

774:   analytical evaluation according to  Eq.\ (\ref{csymexp}), the

775:   curves plotted with lines and symbols represent the numerical results

776:   obtained from the original data set, and the dashed lines represent the

777:   results obtained from the corresponding bootstrap samples. Note that for

778:   small values of $x_n$ the condition $c(\eta, x_n, \lambda)$ is for all values of $\eta$ close

779:   to zero.\label{fig:csymexp}}}

780: %\end{figure}

781: %%%%%%

782: \hspace*{0.5cm}

783: %%%%%%%%%%%%%

784: %\begin{figure}[t!!!]

785: \parbox{8.5cm}{

786: \epsfig{file=figure5.eps,width=8cm}

787: \caption{The ROCs for symmetrically exponentially

788:   distributed i.i.d.\ random numbers show no significant dependence on the

789:   event magnitude.  The ROC curves were made via

790:   predicting increments in $10^7$ normal i.i.d.\ random numbers and the predictions were made according to the prediction

791: strategy described in Sec.\ \ref{pre}. The black

792:   line indicates the analytically evaluated ROC curve for

793:   $\eta=0$. \label{fig:rocsymexp}}

794: }

795: }

796: \end{figure*}

797: %%%%%%%%%%%%%%

798: \begin{eqnarray}

799: c(\eta, x_n,\lambda) & = & \left\{\begin{array}{l}

800: -\sqrt{2}\left(1 - \frac{\left(1 - \frac{1}{2} \exp(-\sqrt{2}\eta - \lambda x_n)\right)}{\left(1 - \frac{1}{8}\exp(-\sqrt{2}\eta)\right)}\right), \quad x_n >0,\\

801: \\

802: -\sqrt{2} + \frac{(1 -\sqrt{2}\eta)}{\eta} \frac{\left(1- \frac{1}{2}\exp(-\sqrt{2}\eta

803:   -\lambda x_n)\right)}{\left(1 -

804:   \frac{\sqrt{2}}{4}\eta \exp(-\sqrt{2}\eta)\right)}, \quad -\eta<x_n<0,\\

805: \\

806: -\frac{1}{\sqrt{2}}\exp(\lambda x_n + \sqrt{2}\eta)\left(\frac{1}{1 -

807:   \frac{1}{2}\exp(\sqrt{2}\eta + \lambda x_n)} + \frac{1}{1 -

808:   \frac{3}{8}\exp(-\sqrt{2}\eta)} \right),\quad x_n < - \eta.

809:  \end{array}\right.  \label{csymexp}

810: \end{eqnarray}

811: \end{widetext}

812:

813: Figure \ref{fig:csymexp} compares the

814: results of the numerical evaluation of the condition and the analytical

815: expression given by Eq.\ (\ref{csymexp}). Since most precursors of large

816: increments can be found among negative values, the numerical evaluation of

817: $c(\eta,x_n\lambda)$  becomes worse for positive values of $x_n$, since in

818: this limit the likelihood is not very well sampled from the data. This leads

819: also to the wide spread of the bootstrap samples in this region.

820:

821: Figure \ref{fig:csymexp} shows that in the vicinity of the smallest value of the data set, the condition

822: $c(\eta,x_n,\lambda)$ is zero. As we approach larger values of $\eta$,

823: $c(\eta,x_n,\lambda)$ approaches zero in the whole range of data values. That

824: is why we would expect to see no influence of the event magnitude on the quality of predictions in the exponential case.

825:

826: The ROC curves in Fig. \ref{fig:rocsymexp} support these results. The

827: numerical ROC curves were made via predicting increments in $10^7$ normal

828: i.i.d.\ random numbers according to the prediction

829: strategy described in Sec. \ref{pre}.

830: %

831: The precursor for the ROC-curves is chosen as the maximum of the likelihood

832: according to Eq.\ (\ref{symexplike}), i.e., $x_{pre}=-\infty$, so that the

833: alarm interval is $[\infty,\delta]$.

834: %

835: In summary there is no significant dependence on the event magnitude for the prediction

836: of increments in a sequence of symmetrical exponential distributed random numbers.

837: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%

838: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%

839: \subsection{Pareto distributed random variables \label{powl}}

840: %%%%%%%%%%%%%%%%

841: \begin{figure}[t!!!]

842: \includegraphics[width=6cm,angle=-90]{figure6.eps}

843: \caption[]{\small The condition $c(\eta,x_n,x_{min},k)$ for the power-law distribution with lower endpoint

844: $x_{min} =0.01$ are plotted for constant values of the precursory variable

845: $x_{n}$. The symbols represent the results of the numerical evaluation of

846: $c(\eta,x_n,x_{min},k)$, the gray (colored) lines denote the analytic results,

847: and the black lines denote the result for the corresponding bootstrap samples

848: and the optimal precursor.

849: %

850: For the \lq\lq ideal" precursor $x_{n}=x_{min}=0.01$ all

851: values of $c(\eta,k, 0.01)$ are negative. Hence one should expect smaller

852: events to be better predictable. However, this effect is sensitive of the

853: choice of the precursor. \label{fig:powerlawcompare}}

854: %%%%%%%%%%%%%%%%}

855: \end{figure}

856: We investigate the Pareto distribution as an example for power-law

857: distributions.

858: %

859: The PDF of the Pareto distribution is defined as \cite{Feller}

860: \begin{equation}

861: \rho(x) = kx_{min}^k \, x^{-(k+1)}

862: \end{equation}

863: for  $x \in [x_{min}, \infty)$ with the exponent $k\geq3$, the lower endpoint

864: $x_{min} > 0$, and variance $\sigma = \frac{x_{min}}{k-1}\sqrt{\frac{k}{k-2}}$.

865: Filtering for increments of magnitude $\eta$

866: we find the following conditional PDFs of the increments:

867: \begin{eqnarray}

868: \rho_c ( x_n, \eta, x_{min},k)& = & \frac{k x_{min} ^{2k}}{ x_n^{k+1} \left(x_n +

869:  \frac{x_{min}}{k+1}\sqrt{\frac{k}{k-2}}\, \eta \right)^k P(\eta,k)}

870: \label{paretoapost}\nonumber\\

871: \\

872: %

873: \rho_f (x_n, \eta, x_{min},k)& = & \frac{kx_{min}^k}{x_n^{k+1}} \frac{\left(1 -

874:   \left(\frac{x_{min}}{x_n +  \frac{x_{min}}{k+1}\sqrt{\frac{k}{k-2}} \,\eta

875:   }\right)^k\right)}{1 - P(\eta,k)}\nonumber\\

876: \\

877: %

878: L (\eta, x_n, x_{min},k)& = &\left( \frac{x_{min}}{x_n +

879:   \frac{x_{min}}{k+1} \sqrt{\frac{k}{k-2} \, \eta}}\right)^k .

880: \label{paretolikeli}

881: \end{eqnarray}

882: Within the range $(x_{min}, \infty)$ the likelihood has no well defined

883: maximum. However, since the likelihood is a monotonously decreasing

884: function, we use the lower endpoint $x_{min}$ as a precursor.

885: The total probability to find events of magnitude $\eta$ is given by

886: %\begin{widetext}

887: \begin{eqnarray}

888: P(\eta,k)& = & \frac{1}{2} \,{}_{2}F_1

889: \left(k, 2k, 2k+1,- \frac{\eta}{(k+1)}\sqrt{\frac{k}{k-2}}\right), \nonumber \\\label{paretototal}

890: \end{eqnarray}

891: where $ {}_{2}F_1(a,b,c,x) $ denotes  the hypergeometric function $ {p}_{2}F_q(a,b,c,x)$ with

892: $p=2$, $q=1$.

893: %\end{widetext}

894:

895: %%%%%%%%%%%%%%%%

896: \begin{figure}[t!!!]

897: \includegraphics[width=6cm, angle= -90]{figure7.eps}

898: \caption[]{\small\label{fig:powlroc3}ROC-plot for the power-law distribution with $k=3$ and $x_{min} =0.01$. The

899: symbols show the numerical results and the lines indicate the analytically

900: calculated ROC curves.

901: The ROC curves were made via predicting increments in $10^7$ Pareto

902: distributed i.i.d.\ random numbers. The predictions were made according to

903: the prediction strategy described in Sec.\ \ref{pre}. Note that we tested only

904: event magnitudes $\eta$, for which we found at least $1000$ events, so that the

905: effects we observe are not due to a lack of statistics of the large events. The ROC curves display

906: that in Pareto distributed i.i.d.\ random numbers with the lower endpoint

907: $x_{min}=0.01$ smaller events are better predictable and that large events are

908: very hard to predict.}

909: \end{figure}

910: %%%%%%%%%%%%%%%

911:

912: %%%%%%%%%%%%%%%%

913: %\begin{figure}[t!!!]

914: %\includegraphics[width=6cm, angle= -90]{figure8.eps}

915: %\caption[]{\small\label{fig:powlroc6} (Color online)

916: %ROC-plot for the power-law distribution with $k=6$ and $x_{min} =0.01$. The

917: %symbols show the numerical results, the lines indicate the analytically

918: %calculated ROC-curves.

919: %The ROC-curves where made via predicting increments in $10^7$ Pareto distribute%d

920: %i.~i.~d.\ random numbers. The predictions were made according to the prediction%

921: %strategy described in Sec.\ \ref{pre}. The ROC-curves show

922: %that in Pareto distributed i.~i.~d.\ random numbers with the lower endpoint

923: %$x_{min}=0.01$ smaller events are better predictable and that large events are %especially

924: %hard to predict. }

925: %\end{figure}

926: %%%%%%%%%%%%%%%

927:

928: Using

929: \begin{eqnarray}

930: \ \frac{\partial P(\eta, k)}{\partial \eta} & = & \frac{k}{\eta} \left(

931: \frac{1}{\left(1 + \frac{\eta}{k+1}\sqrt{\frac{k}{k-2}} \right)^k} - 2P(\eta,k)\right) \nonumber\\

932: \end{eqnarray}

933: and inserting the expressions (\ref{paretolikeli}) and (\ref{paretoapost}) for the components of

934: $c(\eta,x_n, x_{min},k)$ we can obtain an explicit analytic expression for

935: the condition. In Fig.\ \ref{fig:powerlawcompare} we evaluate this expression using {\it

936:   Mathematica} and compare it with the results of an empirical evaluation on

937: the data set of $10^7$ i.i.d.\ random numbers.

938:

939: Figure (\ref{fig:powerlawcompare}) displays that the value

940: of $c(\eta,x_n, x_{min},k)$ depends sensitively on the choice of the

941: precursor.

942: %

943:  For the ideal precursor $x_{pre}=x_{min}$ all values of $c(\eta,k, x_{min})$ are negative.

944: %

945: Hence one should in this case expect smaller

946: events to be better predictable.

947: %

948: The corresponding ROC curves in Figs.\

949: \ref{fig:powlroc3}, %%\ref{fig:powlroc6}

950: and \ref{fig:powlroc9} verify this

951: statement of $c(\eta,x_n, x_{min},k)$.

952:

953: In summary we find that larger events in Pareto distributed i.~i.~d.\ random

954: numbers are harder to predict the larger they are. This is an admittedly

955: unfortunate result, since extremely large events occur much more frequently in

956: power-law distributed processes than in Gaussian distributed processes. Hence,

957: their prediction would be highly desirable.

958: %%%%%%%%%%%%%%%%

959: \begin{figure}[t!!!]

960: \includegraphics[width=6cm, angle= -90]{figure9.eps}

961: \caption[]{\small\label{fig:powlroc9}ROC-plot for the power-law distribution with $k=9$ and $x_{min} =0.01$.

962: %

963: The symbols show the numerical results and the lines indicate the analytically

964: calculated ROC curves.

965: %

966: The ROC curves where made via predicting increments in $10^7$ Pareto distributed

967: i.i.d.\ random numbers and the predictions were made according to the prediction

968: strategy described in Sec.\ \ref{pre}.

969: %

970: The ROC-curves display that in Pareto distributed i.i.d.\ random numbers smaller events are better

971: predictable and large events are especially hard to predict.}

972: \end{figure}

973: %%%%%%%%%%%%%%%

974:

975: \section{Increments in Free Jet Data \label{freejet}}

976: In this section, we apply the method of statistical inference to predict

977: acceleration increments in free jet data.

978: %

979: Therefore we use a data set of

980: $1.25\times 10^7$ samples of the local velocity measured in the turbulent

981: region of a round free jet \cite{Peinke}.

982: %

983: The data were sampled by a hot-wire

984: measurement in the central region of an air into air free jet.

985: %

986: One can then calculate the PDF of velocity increments  $a_{n,k}=v_{n+k} -v_n$, where $v_n$ and

987: $v_{n+k}$ are the velocities measured at time step $n$ and $n+k$. The Taylor

988: hypothesis allows one to relate the time-resolution to a spatial

989: resolution \cite{Peinke}.

990: %

991: One observes that for large values of $k$ the PDF of increments is essentially indistinguishable

992: from a Gaussian, whereas for small $k$, the PDF develops approximately

993: exponential wings \cite{vanAtta, Frisch1990, Frisch1995}.

994: %

995: Fig.\ \ref{fjhisto} illustrates this effect using the data set under study.

996: %%%%%%%%%%%%%%%%%%%%

997: \begin{figure}

998: \epsfig{file=figure10.eps,width=8cm}

999: \caption{\label{fjhisto}PDF of the increments $a_{n,k}=v_{n+k}-v_n$ with $k=

1000:   1,3,10,35,144,285$. The black lines correspond to Gaussian and exponential

1001:   PDFs with appropriate values for the standard deviation or the coefficient $\lambda$.}

1002: \end{figure}

1003: %%%%%%%%%%%%%%%

1004: %%%%%%%%%%%%%%%%%%%%%%%

1005: \begin{figure*}[t!!!]

1006: \centerline{

1007: \epsfig{file=figure11.eps,width=8cm} \hspace{0.01cm}

1008: \epsfig{file=figure12.eps,width=8cm}}

1009: \caption[]{\label{fjcondi} Transition from the exponential regime

1010:   (a) to the Gaussian regime (b) characterized via the numerical evaluation of

1011:   $c(x_n,\eta)$. The black line corresponds to the analytic results for the

1012:   Gaussian and the exponential PDF, fitted to the PDFs of the

1013:   increments, as it is shown in Fig.\ \ref{fjhisto}. For larger values of $\eta$ the main features of

1014: $c(x_n,\eta)$ for the exponential and the Gaussian case as described in

1015: Sec.\ \ref{Gaussian} and \ref{symexpo} are reproducable. For larger values of $\eta$

1016: we find that if $-\sigma \eta < a_{n,k} -\sigma \eta/2 $ $c(a_n,\eta)$ is either

1017: larger than zero in the asymptotically Gaussian case ($k=144$) or equal to zero in the

1018: asymptotically exponential case ($k=10$).}

1019: \end{figure*}

1020: %%%%%%%%%%%%%%%%%%

1021: Thus the incremental data sets $a_{n,k}$ provides us with the opportunity to test the

1022: results for statistical predictions within Gaussian and exponential

1023: distributed i.i.d.\ random numbers on a data set, which exhibits

1024: correlated structures.

1025:

1026: We are now interested in predicting increments of the acceleration

1027: $a_{n+j,k} - a_{n,k} \geq \eta$  in the incremental data sets  $a_{n,k}=v_{n+k}

1028: -v_n$.

1029: %

1030: In the following we concentrate on the incremental data set $a_{n,10}$, which

1031: has an asymptotically exponential PDF and the data set $a_{n,144}$, which has

1032: an asymptotically Gaussian PDF.

1033: %

1034: Furthermore we focus on increments between

1035:  relatively large time steps, i.e., $j=285$, so that the short-range persistence of the

1036: process does not prevent large events from occuring.

1037: %

1038: As in the previous

1039: sections we are hence exploiting the statistical properties of the time series

1040: to make predictions, rather than the dynamical properties.

1041:

1042: We can now use the evaluation algorithm which was tested on the previous

1043: examples to evaluate the condition for these data sets.

1044: %

1045: The results are shown in Fig.\ \ref{fjcondi}.

1046: %

1047: We find that at least for larger values of $\eta$ the main features of

1048: $c(x_n,\eta)$ for the exponential and the Gaussian case as described in

1049: Sec. \ref{Gaussian} and \ref{symexpo} are also present in the free jet

1050: data.

1051: %

1052: For larger values of $\eta$, $c(a_{n,k},\eta)$ is either

1053: larger than zero in the Gaussian case ($k=144$) or equal to zero in the

1054: exponential case ($k=10$) in the region of interesting precursory variables, i.e., small

1055: values of $a_{n,k}$.

1056: %%%%%%%%%%

1057: \begin{figure*}

1058: \centerline{

1059: \hspace{2cm}

1060: \epsfig{file=figure13.eps,width=6cm, angle=-90}

1061: \epsfig{file=figure14.eps,width=6cm, angle= -90}

1062: }

1063: \caption{\label{fjrocs}Transition from exponential ROC curves (a)  to Gaussian

1064:   ROC curves (b). In the exponential case ($k=10$),

1065: shown in (a) the ROC curves for different event magnitude

1066: $\eta$ are almost the same, although the range of $\eta$ is larger ($\eta \in

1067: (0,6.71)$) than in the Gaussian case shown in (b). For $k=144$ the ROC curves are further

1068: apart, which corresponds to the results for Gaussian ROC curves (see

1069: Sec.\ \ref{Gaussian}) }

1070: \end{figure*}

1071: %%%%%%%%%%%%%%

1072:

1073: However, the presence of the exponential and the Gaussian distributions is more

1074: prominent in the corresponding ROC curves. For the free jet data set, the

1075: predictions were made with an algorithm similar to the one described in

1076: Sec. \ref{pre}. Instead of a specific precursory

1077: structure, which corresponds to the maximum of the likelihood, we use here a

1078: threshold of the likelihood as a precursor. In this setting we give an alarm

1079: for an extreme event, whenever the likelihood that an extreme event follows an

1080: observation is larger than a given threshold value.

1081:

1082: In the exponential case ($k=10$)

1083: shown in Fig.\ \ref{fjrocs}(a) the ROC curves for different event magnitude

1084: $\eta$ almost coincide, although the range of $\eta$ is larger ($\eta \in

1085: (0,6.71)$) than in the Gaussian case shown in Fig.\ \ref{fjrocs} (b). For $k=144$ the ROC curves are further

1086: apart, which corresponds to the results of Secs. \ref{Gaussian} and \ref{symexpo}.

1087:

1088: This example of the free jet data set shows that the specific dependence of

1089: the ROC curve on the event magnitude can also in the case of correlated data

1090: sets be characterized by the PDF of the underlying process.

1091: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%

1092: %\newpage

1093: \section{Conclusions \label{conclusions}}

1094: We study the magnitude dependence of the quality of predictions for increments in a

1095: time series which consists in sequences of i.i.d.\ random numbers and in

1096: acceleration increments measured in a free jet flow.

1097: %

1098: Using the first part of

1099: the increment $x_n$ as a

1100: precursory variable we predict large increments $x_{n+1} -x_n$ via statistical

1101: considerations.

1102: %

1103: In order to measure the quality of the predictions we use ROC curves.

1104: %

1105: Furthermore we introduce a quantitative criterion which can determine

1106: whether larger or smaller events are better predictable. This criterion is

1107: tested for time series of Gaussian, exponential and Pareto i.i.d.\ random

1108: variables and for the increments of the acceleration in the free jet flow. The

1109: results obtained from the criterion comply nicely with the corresponding ROC-curves.

1110: Note that for both, the numerical evaluation of the condition and the

1111: ROC-plots, we used only event magnitudes $\eta$ for which we found at least $1000$

1112: events, so that the observed effects are not due to a lack of statistics of

1113: the large events.

1114:

1115: In the sequence of Gaussian i.i.d.\ random numbers, we find that large

1116: increments are better predictable the larger they are. In the Pareto

1117: distributed time series we observe that in slowly decaying power laws larger

1118: events are harder to predict, the larger they are. We find no significant

1119: dependence on the event-magnitude for the sequence of exponentially i.i.d.\ random numbers.

1120:

1121: While the condition can be easily evaluated analytically, it is not that easy

1122: to compute numerically from observed data, since the

1123: calculation implies evaluating the derivatives of numerically obtained

1124: distributions.

1125: %

1126: Using Savitzky-Golay filters improved the results, but

1127: especially in the limit of larger events, where the distributions are

1128: difficult to sample, one cannot trust the results of the numerically

1129: evaluated criterion.

1130: %

1131: However, it is still possible to apply the criterion by

1132: fitting a PDF to the distribution of the underlying process and then evaluate

1133: the criterion analytically.

1134:

1135: Although the magnitude dependence of the quality of predictions was observed in

1136: different contexts and for different measures of predictability, in this

1137: contribution only ROC curves were used. In order to exclude the possibility

1138: that the effect is specific to the ROC curve, future works should also include

1139: other measures of predictability.

1140:

1141: Reviewing the results for the Gaussian case and the slowly decaying power law

1142: from a philosophic point of view one can conclude that nature allows us to

1143: predict large events from the most frequently occuring distribution

1144: easily. However in Gaussian distributions very large events are rare and

1145: therefore less likely to cause damage. Whereas in the less frequently occurring

1146: distributions with heavy power-law tails, large events are especially hard to predict.

1147: %

1148: Therefore one can assume, that rare large impact events of processes with power-law

1149: distributions will remain unpredictable, although their prediction would be highly desirable.

1150:

1151: \acknowledgements

1152: We thank J. Peinke and his group for supplying us with the free jet data.

1153: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%

1154: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%

1155: \begin{appendix}

1156: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%

1157:

1158: \section{Obtaining the analytic expression for the likelihood, the joint and the aposterior PDFs for increments in stochastical processes \label{appA}}

1159: An analytic expression for a filter which selects the PDF of our

1160: extreme increments $x_{n+1} - x_n \geq d$ out of the PDFs of the underlying stochastic process can be

1161: obtained through the  Heaviside function $ \Theta( x_{n+1} - x_{n}-d)$.

1162: %

1163: (Note that $d$ is not scaled by the

1164: standard deviation, i.e., $d =\sigma\eta$.)

1165: %

1166: This filter is then applied to the joint PDF $j(x_0, x_1, ...,x_{n-k+1}, x_{n-k+2},

1167: ..., x_{n})$ of a stochastic process or to be more precise to the likelihood

1168: $L(x_{n+1}|x_0, x_1, ...,x_{n-k+1}, x_{n-k+2}, ..., x_{n})$ that the $n+1$

1169: step follows the previously obtained values. If we condition only on the last

1170: $k$ values, we neglect the dependence on the past.

1171: %

1172: The likelihood that an event Y(d)=1 follows in the $n+1$th step can then be

1173: obtained by multiplication with $\Theta( x_{n+1} - x_{n}-d)$.

1174: \begin{widetext}

1175: \begin{eqnarray}

1176: L(Y_{n+1}(d)=1| \mathbf{x}_{(n-k+1,n)}) & = & \Theta( x_{n+k} - x_{n}-d) L(x_{n+1}|\mathbf{x}_{(k,n)} ),

1177: \end{eqnarray}

1178:

1179: where $\mathbf{x}_{(n-k+1,n)}= (x_{n-k+1},x_{n-k+2}, ...,x_n) $  as defined in Sec.\ \ref{pre}.

1180: \end{widetext}

1181: If the resulting

1182: expression is nonzero, the condition of the extreme event in Eq.\ (\ref{e0}) is

1183: fulfilled and for $x_{n+1}$ and $x_{n}$ the following relation holds:

1184: %

1185: \begin{eqnarray}

1186: x_{n+1} & = & x_{n} + d + \gamma \label{gammadef} \quad (\gamma \in  \mathbb R,

1187: \gamma \geq 0) \quad. \label{gamma0}

1188: \end{eqnarray}

1189: %

1190: Hence it is possible to express the likelihood in terms of $x_{n}$, which is a part of the precursory structure. We can use the integral representation of the Heaviside function with appropriate substitutions to obtain

1191: %

1192: \begin{widetext}

1193: \begin{eqnarray}

1194: L(Y_{n+1}(d)=1|\mathbf{x}_{(n-k+1,n)}) &=& \int_{0}^{\infty} L(x_{n} + d + \gamma|\mathbf{x}_{(n-k+1,n)} )\;d\gamma.\quad\label{int1}

1195: \end{eqnarray}

1196: \end{widetext}

1197: %

1198:

1199: Hence the joint PDF, the aposterior PDF and the total probability to find

1200: increments are given by

1201: \begin{widetext}

1202: \begin{eqnarray}

1203: j\bigl(\mathbf{x}_{(n-k+1,n)},Y_{n+1}(d)=1\bigr) & = & j\bigl(\mathbf{x}_{(n-k+1,n)}\bigr) \cdot L\bigl(Y_{n+1}(d)=1|\mathbf{x}_{(n-k+1,n)}\bigr)\\

1204: \rho\bigl(\mathbf{x}_{(n-k+1,n)}|Y_{n+1}(d)=1\bigr) & = & \frac{j\bigl(\mathbf{x}_{(n-k+1,n)},Y_{n+1}(d)=1\bigr)}{P(Y(d)=1)},\\

1205: P(Y(d)=1) &=& \int_{-\infty}^{\infty} dx_{n-k+1} \int_{-\infty}^{\infty} dx_{n-k+2} ... \int_{-\infty}^{\infty} dx_n \; j(\mathbf{x}_{(0,n-k)}\mathbf{x}_{(n-k+1,n)},Y_{n+1}(d)=1). \label{appendixptotal}

1206: \end{eqnarray}

1207: \end{widetext}

1208:

1209: Whether we can acess a given stochastical process analytically or not depends on the question of whether the integrals in Eq.\ (\ref{appendixptotal}) can be solved or not.

1210:

1211: If we are interested in the prediction of threshold crossings instead of

1212: increments, we can interpret $\eta$ as the magnitude of the threshold and set

1213: $x_n=0$ in order to obtain the corresponding expressions for the likelihood,

1214: the joint PDF, the aposterior PDF, and the total probability.

1215: \end{appendix}

1216: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%

1217: \begin{thebibliography}{99}

1218: %1

1219: \bibitem{Jackson} David D. Jackson, {\sl Hypothesis testing and earthquake

1220: prediction}, Proc. Natl. Acad. Sci. U.S.A.\ {\bf 93} 3772 (1996).

1221: %3

1222: \bibitem{Physa} H. Kantz, D. Holstein, M. Ragwitz and N. K. Vitanov, {\sl Markov

1223: chain model for turbulent wind speed data}, Physica {\bf A 342} 315 (2004).

1224: %10

1225: \bibitem{Euromech}

1226: Holger Kantz, Detlef Holstein, Mario Ragwitz and Nikolay K. Vitanov, {Short

1227: time prediction of wind speeds from local measurements}, in: {\sl Wind Energy

1228: -- Proceedings of the EUROMECH Colloquium}, edited by J. Peinke, P. Schaumann and S. Barth, (Springer, New York, 2006).

1229: %

1230: \bibitem{Sarah2} S.\ Hallerberg , E.\ G.\ Altmann, D.\ Holstein and H.\ Kantz, {\sl Precursors of Extreme

1231: Increments}, Phys.\ Rev.\ E {\bf 75}, 016706 (2007)

1232: %

1233: \bibitem{Fatehme} M.Reza Rahimi Tabar, M. Sahimi, F. Ghasemi, K. Kaviani,

1234: M. Allamehzadeh, J. Peinke, M. Mokhtaru and M. Vesaghi,

1235: M. D. Niry, A. Bahraminasab, S. Tabatabai, S. Fayazbakhsh and M. Akbari {\sl

1236: Short-Term Prediction of Medium- and Large-Size Earthquakes Based on Markov

1237: and Extended Self-Similarity Analysis of Seismic Data} arXiv:physics/0510043 v1 6 Oct 2005.

1238: %

1239: \bibitem{Sandpile} A. B. Shapoval and M. G. Shrirman, {\sl How size of target

1240:   avalanches influence prediction efficiency}, International Journal of Modern

1241:   Physics C, {\bf 17}, 1777 (2006).

1242: %

1243: \bibitem{Johnson1} D. Lamper, S.D. Howison and N.F. Johnson, {\sl Predictability

1244: of Large Future Changes in a Competitive Evolving Population},

1245: Phys. Rev. Lett. {\bf 88}, 017902 (2002).

1246: %

1247: \bibitem{Egans} J. P. Egan, {\sc Signal detection theory and ROC analysis},

1248: Academic Press, New York 1975.

1249: %17

1250: \bibitem{Box-Jen} G. E. P. Box, G. M. Jenkins and G. C. Reinsel, {\sc Time Series

1251: Analysis} (Prentice-Hall, Inc., Eaglewood Clif., NJ, 1994).

1252: %8

1253: \bibitem{Brockwell} P.J. Brockwell and R.A. Davis, {\sc Time Series: Theory and

1254: Methods} (Springer, New York, 1998).

1255: %9

1256: \bibitem{likelihood} J. M. Bernado and A. F. M. Smith, {\sc Bayesian Theory} (Wiley, New

1257: York, 1994).

1258: %15

1259: \bibitem{Swets1}D. M. Green and J. A. Swets, {\sl Signal detection theory and

1260: psychophysics.} (Wiley, New York, 1966).

1261: %16

1262: \bibitem{Pepe} M. S. Pepe, {\sc The Statistical Evaluation of Medical Tests for

1263: Classification and Prediction}, (Oxford University Press, New York, 2003).

1264: %

1265: \bibitem{bandi} J. Broecker and L. A. Smith {\sl Scoring Probabilistic

1266:   Forecasts: On the Importance of Being Proper}, Weather and Forecasting {\bf 22}, 382 (2007).

1267: %19

1268: \bibitem {Sornette2} A. Johansen and D. Sornette,

1269: {\sl Stock market crashes are outliers}, European Physical Journal B {\bf 1}, 141 (1998).

1270: %5

1271: \bibitem{stocks}  N. Vandewalle, M. Ausloos, P. Boveroux, et al.,

1272: {\sl How the financial crash of October 1997 could have been predicted},

1273: European Physical Journal B {\bf 4} 139 (1998).

1274: %6

1275: \bibitem{Savgol} A. Savitzky and M. J. E. Golay {\sl Smoothing and

1276:   Differentiation of Data by Simplified Least Squares Procedures}, Analytical

1277:   Chemistry {\bf 36} 1627 (1964).

1278: %

1279: \bibitem{numrecipes} W. H. Press, {\sc Numerical recipes in C} (Cambridge

1280:   University Press, Cambridge, England 1992).

1281: %

1282: \bibitem{Abram} M. Abramowitz, and I. A. Stegun, {\sc Handbook of Mathematical

1283: Functions}, (Dover, New York, 1972).

1284: %

1285: \bibitem{Feller} W. Feller, {\sc An Introduction to Probability Theory and Its

1286:   Applications} (Wiley, New York (1970), Vol. II.

1287: %

1288: \bibitem{Peinke} C. Renner, J. Peinke and R. Friedrich, {\sl Experimental

1289:   indications for Markov properties of small-scale turbulence},

1290:   J. Fluid. Mech. {\bf 433}, 383 (2001).

1291: %

1292: \bibitem{vanAtta} C. W. Van Atta and J. Park, { Statistical self-similarity

1293:   and intertial subrange turbulence}, in {\sl Statistical Models and

1294:   Turbulence}, edited by M. Rosenblatt and C. W. Van Atta, Lect. Notes in Phys., Vol. 12,

1295: (Springer, Berlin, 1972), pp. 402-426.

1296: %

1297: \bibitem{Frisch1990} Y. Gagne, E. Hopfinger and U. Frisch {A new universal

1298:   scaling for fully developed turbulence: the distribution of velocity

1299:   increments} in {\sl New Trends in Nonlinear Dynamics and Pattern-Forming

1300:   Phenomena} edited by P. Coullet and P. Huerre, NATO ASI (Plenum Press, New York 1990),

1301: Vol. 237, pp. 315-319.

1302: \bibitem{Frisch1995} U. Frisch {\sc Turbulence} Cambridge University Press, (Cambridge, England, 1995).

1303: %

1304: \end{thebibliography}

1305: %

1306: \end{document}

1307:

1308: