1: \documentclass[twocolumn,showpacs,preprintnumbers,amsmath,amssymb]{revtex4}
2:
3: \bibliographystyle{/mnt/p0friedr/tdfrank/styles/apsrev}
4: \usepackage{epsfig}
5:
6: \begin{document}
7: \title{An Iterative Procedure for the Estimation of Drift and
8: Diffusion Coefficients of Langevin Processes}
9: \author{D.~Kleinhans, R.~Friedrich}
10: \affiliation{Institute for Theoretical Physics, University of
11: M\"unster, D-48149 M\"unster, Germany}
12:
13: \author{A.~Nawroth, J.~Peinke}
14: \affiliation{ Institute for Physics, Carl-von-Ossietzky University Oldenburg, D-26111 Oldenburg, Germany}
15: \date{\today}
16:
17:
18: \begin{abstract}
19: A general method is proposed which allows one
20: to estimate drift and diffusion coefficients of a stochastic process
21: governed by a Langevin equation. It extends a previously devised
22: approach [R. Friedrich et al.,
23: Physics Letters {\bf A 271}, 217 (2000)], which
24: requires sufficiently high sampling rates.
25: The analysis is based on an iterative procedure minimizing
26: the Kullback-Leibler distance between measured and estimated
27: two time joint probability distributions of the process.
28:
29: \end{abstract}
30:
31: \pacs{87.23.Cc,02.50.Ey,05.40.Jc}
32: \maketitle
33:
34:
35: \section{Introduction}
36: Complex behavior in systems far from equilibrium can quite often
37: be traced back to rather simple laws due to the existence
38: of processes of selforganization \cite{Haken1}.
39: Since complex systems are composed
40: of a huge number of subsystems, however, fluctuations stemming from
41: the microscopic degrees of freedom play an important role
42: introducing a temporal variation on a fast time scale which quite
43: often can be considered as fluctuations.
44: The consequence is the existence of evolution equations of a set of
45: macroscopic order parameters ${\bf q}(t)$ which are governed by nonlinear
46: Langevin equations \cite{Risken}, \cite{Gardiner}:
47: \begin{equation}\label{Lange}
48: \frac{d}{dt}q_{i} = D_i^1({\bf q}) + \sum_l g_{il}({\bf q}) F_l(t)\quad ,
49: \end{equation}
50: where ${\bf q}(t)$ denotes the n-dimensional state vector, ${\bf
51: D}^1({\bf q})$ is the drift vector and the matrix $g({\bf q})$
52: is related to the diffusion matrix according to
53: $\left(D^2({\bf q})\right)_{ij}
54: =\sum_k g_{ik}({\bf q}) g_{jk}({\bf q})$. ${\bf F}(t)$ are fluctuating forces
55: with Gaussian statistics
56: delta-correlated in time: $<F_l(t)>=0$,
57: $<F_l(t) F_k(t')>=2\delta_{lk}\delta(t-t')$.
58: Here and in
59: the following we adopt It\^o's interpretation of stochastic
60: integrals \cite{Risken}, \cite{Gardiner}.
61:
62: Analyzing complex systems, which can be described by
63: stochastic equations of the form (\ref{Lange}), therefore, amounts to
64: assess the underlying Langevin equations or the corresponding
65: Fokker-Planck equations from an inspection of experimentally
66: determined time series \cite{Haken2}.
67: Recently, an operational method \cite{Siegert1}, \cite{Siegert2}
68: has been devised, which allows one to
69: estimate drift and diffusion coefficients of the
70: stochastic processes from experimental data.
71: This method has been successfully applied to various problems in the
72: field of complex systems like the analysis of noisy electrical circuits
73: \cite{Siegert2}, stochastic dynamics of metal cutting
74: \cite{Grad1}, systems with feedback delay \cite{Frank1},
75: meteorological processes like wind-driven Southern Ocean variability
76: \cite{Sura1}, traffic flow data \cite{Kriso} and physiological time series \cite{Kuusela04}.
77: Furthermore it has been applied
78: to problems like turbulent flows \cite{PRL}, \cite{JFM},
79: passive scalar advection \cite{Tutku},
80: financial time series \cite{PRLfinanz}, analysis of rough surfaces
81: \cite{Jafari}, \cite{Waechter}, which can be characterized as a
82: stochastic process with respect to a scale variable exhibiting
83: markovian properties in scale.
84:
85: The method is based on the evaluation of the time limits
86: the first and second conditional moments,
87: \begin{subequations}
88: \label{est}
89: \begin{eqnarray}
90: {\bf D}^1({\bf q}) &=& \lim_{\tau \rightarrow 0} \frac{1}{\tau}
91: < {\bf q}(t+\tau)-{\bf q}(t)|{\bf q}(t)={\bf q}> \\
92: { D}^2_{ij}({\bf q}) &=& \lim_{\tau \rightarrow 0} \frac{1}{2\tau}
93: < [{\bf q}(t+\tau)-{\bf q}(t)]_{i}\nonumber\\
94: &&[{\bf q}(t+\tau)-{\bf q}(t)]_{j}|{\bf q}(t)={\bf q}>\quad .
95: \end{eqnarray}
96: \end{subequations}
97: From these expressions it becomes evident that the sampling rate in the
98: experiments has to be sufficiently high in order to allow for a
99: reliable evaluation of the limit $\tau \rightarrow 0$.
100: Therefore, in all
101: applications mentioned above
102: the results have been checked in a selfconsistent manner by a
103: recalculation of conditional pdf's from the estimated Fokker-Planck
104: equation. Possible problems in estimating drift and diffusion coefficients
105: related with low sampling frequencies have been adressed by Sura
106: \cite{Sura}, Ragwitz and Kantz \cite{Ragw}, \cite{Kantzcom} and
107: Friedrich et al. \cite{Kantzrepl}.
108:
109: The aim of the present letter is to devise an extension of the above
110: method in order to overcome problems related with the time limit $\tau
111: \rightarrow 0$. These problems immediately show up for low
112: sampling rates.
113: We also want to point out that for the case of stochastic forces
114: ${\bf F}(t)$ with small but finite temporal correlations the process is not markovian in the
115: limit $\tau \to 0$. In this case, however, one should use the Stratonovich
116: interpretation of stochastic processes \cite{Risken}.
117:
118: % We also want to point out that for the case of stochastic forces
119: % $\bf{F}(\bf{q},t)$ with small but finite temporal correlations the
120: % limit $\tau \to 0$ cannot be used for approximating the process by a
121: % markovian one. In this case, however, one should use the Stratonovich
122: % interpretation of stochastic processes \cite{Risken}.
123:
124: % or for the case of stochastic forces ${\bf F}({\bf q},t)$
125: % with small but finite temporal correlations.
126: % Furthermore, uncorrelated noise
127: % sources, so-called measurement noise \cite{Siefert} additionally may limit the
128: % accuracy of the estimates (\ref{est}).
129:
130: \section{Description of the Method}
131: The starting point is a first estimate of drift and
132: diffusion coefficients by the expressions (\ref{est}) evaluated for
133: the smallest reliably possible values of $\tau$. The second step
134: is an embedding of drift and diffusion coefficients into a family of
135: functions ${\bf D}^1({\bf q},\sigma)$, ${\bf D}^2({\bf q},\sigma)$
136: parameterized by a set of free parameters $\sigma$. The expressions
137: obtained in the first step
138: already yield a crude estimate of the parameters $\sigma$.
139: The third step consists in optimizing the free parameters
140: ${\sigma}$.
141:
142: Optimization of the free parameters can be performed
143: in the following way. One determines the
144: conditional probability distribution
145: \begin{equation}
146: p({\bf q},t|{\bf q}_0,t_0;{\bf\sigma})
147: \end{equation}
148: for the parameter set ${\sigma}$ either by a
149: simulation of the Langevin equations or by a numerical
150: solution of the corresponding Fokker-Planck equation. In each case,
151: one can determine the two point pdf $f({\bf q},t;{\bf
152: q}_0,t_0;{\sigma})=p({\bf q},t|{\bf q}_0,t_0;{\sigma})f({\bf
153: q}_0,t_0)$ .
154: The reader should note that this
155: may be done for various finite values of $t-t_0$. The obtained two time
156: pdf can now be compared with the experimental one. A suitable measure
157: for the distance is the Kullback-Leibler information \cite{Haken2}
158: defined according to
159: \begin{eqnarray}
160: \label{kullb_information}
161: K({\sigma},t,t_0) &=&\int d{\bf q} \int d{\bf q}_0
162: f_{exp}({\bf q},t;{\bf q}_0,t_0)
163: \nonumber \\
164: &\times &
165: \ln \frac{f_{exp}({\bf q},t;{\bf q}_0,t_0)}{f({\bf q},t;{\bf q}_0,t_0,{\sigma)}}\qquad .
166: \end{eqnarray}
167:
168: The minimum of the Kullback-Leibler information with respect to the parameters
169: ${\sigma}$ yields estimates of drift and diffusion of
170: a stochastic process. This process is the best approximation
171: with respect to this measure in
172: the class of stochastic processes characterized by the parameters
173: ${\sigma}$. The problem of identifying a stochastic process is then
174: equivalent to determining a minimum of the Kullback information. In practice
175: the minimum can be determined by gradient or genetic
176: algorithms and solved by standard methods \cite{weinstein90}.
177: In the following we shall consider cases, where it
178: is possible to obtain a parametrization of the stochastic processes by
179: only few parameters $\sigma$ such that the Kullback-Leibler measure
180: can be investigated by graphical means.
181:
182: \section{Examples}
183: For certain classes of stochastic processes
184: the above procedure can be reduced considerably by the fact that
185: only few free parameters for the parametrization
186: of drift and diffusion terms have to be introduced. As a consequence
187: the minimization procedure of the Kullback-Leibler information
188: is greatly facilitated.
189:
190: \subsection{One dimensional systems}
191:
192: \begin{figure}
193: \begin{center}
194: \includegraphics[width=8.6cm]{mult01u2.dat.bw.eps}
195: \end{center}
196: \caption{Segment of the one-dimensional synthetic time series I. }
197: \label{mult01u2.dat}
198: \end{figure}
199:
200: The case of one-dimensional systems allows for the following
201: treatment due to the fact that the
202: stationary pdf, which is assumed to exist, can be determined
203: analytically:
204: \begin{equation}
205: f(q)=\frac{N}{D^{2}(q)} e^{\ \int\limits^q dq' \frac{D^{1}(q')}{D^{2}(q')}} \qquad .
206: \end{equation}
207: As a consequence, we have the relationship
208: \begin{equation}
209: \label{multnoise}
210: D^{1}(q)=D^{2}(q)\frac{d}{dq} \ln f(q)+\frac{d}{dq}D^{2}(q) \qquad .
211: \end{equation}
212:
213: Since $f(q)$ can be determined from the time series
214: an estimate in terms of a parameterized ansatz
215: for the diffusion term suffices. In fact, one may use the ansatz
216: $D^2(q)=Q+ aq^2 +b q^4+\ldots$ , which
217: helps in lowering the number of parameters $\sigma$ to be estimated by
218: the above procedure of minimization the Kullback-Leibler information.
219: The drift then follows from (\ref{multnoise}).
220:
221: \begin{figure}
222: \begin{center}
223: \includegraphics[width=8.6cm]{mult02.out.map.bw.eps}
224: \end{center}
225: \caption{Kullback distance $K(Q,a)$ as function of the parameters $Q$
226: and $a$ for time series I. The lines are equidistant
227: contour lines starting from $2.6\cdot 10^{-4}$ in the center.
228: The distance between contour lines is $5\cdot 10^{-5}$.
229: A clear minimum is located at $(Q,a)=(1,1)$.}
230: \label{mult02.out}
231: \end{figure}
232:
233: Let us consider system I with drift and diffusion functions
234: \begin{eqnarray}
235: D^1(q)=q-q^3\quad\mbox{and}\quad D^2(q)=1+q^2
236: \end{eqnarray}
237: driven by a multiplicative noise term.
238: We use synthetic data obtained by numerical integration of the
239: corresponding Langevin equation \cite{Risken},
240: \begin{equation}
241: q(t+\tilde{\tau})=q(t)+\tilde{\tau}D^1\left[q(t)\right]+\sqrt{\tilde{\tau}}D^2\left[q(t)\right]\Gamma(t)\quad.
242: \end{equation}
243: A time series containing $10^6$ points with time
244: increment $10^{-2}$ was generated. The intrinsic increment $\tilde{\tau}$ used for numerical integration
245: of the corresponding Langevin equation was $10^{-5}$.
246: A time segment of the data is presented in fig.~\ref{mult01u2.dat}.
247: Since the stochastic process is stationary and ergodic
248: all statistical quantities can be retrieved from this data.
249:
250: For the estimation of the pdf's from data state space has to be
251: divided into bins. We used $100$ equidistant
252: bins for the stationary pdf. A very accurate way to calculate the
253: integral yielding the Kullback-Leibler distance
254: without running out of memory even for higher dimensional
255: data is to use an adequate local grid for the first argument (the
256: destination) of the conditional pdf's. The conditional pdf then
257: locally can be retrieved from the data for any $({\bf q},{\bf q_{0}})$
258: with high accuracy. The local grid used in this example covered $20$
259: equidistant bins.
260:
261: % ...select a different
262: % amount of bins for the conditional pdf's. The conditional pdf then
263: % locally can be retrieved from the data for any ${\bf q}$
264: % with high accuracy.
265:
266: During the iteration
267: procedure the two point pdf's have to be calculated.
268: We again use the numerical simulation of Langevin processes
269: as a very efficient way to generate these pdf's.
270:
271: Starting from the estimates (\ref{est}) the ansatz $D^2(Q,a,q)=Q+aq^2$
272: is reasonable. The drift immediately follows from (\ref{multnoise})
273: and, for each parameter set $(Q,a)$,
274: one obtains a stationary distribution that equals the experimental one.
275: Due to this fact the evaluation of the conditional
276: pdf $p(q,t+\tau|q_{0},t;Q,a)$ suffices to calculate the Kullback-Leibler
277: distance. A clear minimum of the distance is found at $(Q,a)=(1,1)$
278: corresponding to the original set of parameters.
279: The Kullback distance close to this minimum
280: in the two-dimensional parameter space is
281: exhibited in fig.~\ref{mult02.out}.
282:
283: \subsection{Application to potential systems}
284:
285: \begin{figure}
286: \begin{center}
287: \includegraphics[width=8.6cm]{feb002.dat.1d.eps}
288: \end{center}
289: \caption{Segment of the two-dimensional synthetic time series II.}
290: \label{feb002.dat.1d}
291: \end{figure}
292:
293: \begin{figure}
294: \begin{center}
295: \includegraphics[width=8.6cm]{feb002.out.eps}
296: \end{center}
297: \caption{The Kullback distance $K(Q)$ as a function of the
298: noise strength $Q$ (time series II). A minimum is clearly visible at the value
299: $Q=0.05$.}
300: \label{feb002.out}
301: \end{figure}
302:
303: The procedure for one-dimensional systems can be immediately
304: extended to higher dimensions if one restricts the analysis
305: to the so-called class
306: of potential systems for which the drift vector
307: ${\bf D}^1({\bf q})$ is obtained from a potential
308: $V({\bf q})$ and $g_{ik}=\sqrt{Q}\delta_{ik}$.
309: The central point of our analysis is the following exact expression for
310: the stationary pdf
311:
312: \begin{equation}
313: f({\bf q})=N e^{-V({\bf q}) /Q}
314: \qquad .
315: \end{equation}
316:
317: Since the stationary pdf can be estimated from experimental data
318: one may parameterize the class of stochastic
319: processes by the single variable $Q$. Thus the drift function can be taken
320: to be fixed except for the value $Q$:
321:
322: \begin{equation}
323: {\bf D}^{1}({\bf q})= Q {\bf \nabla} \ln f({\bf q}) \qquad . \label{add_final}
324: \end{equation}
325:
326: As an example we consider the two-dimensional system
327: \begin{equation}
328: {\bf D^{1}}({\bf q})=
329: \left(\begin{array}{c}\epsilon q_{1}-q_{1}\left[q_{1}^2+Bq_{2}^2\right]\\
330: \epsilon q_{2}-q_{2}\left[Bq_{1}^2+q_{2}^2\right]\end{array}\right)
331: \qquad .
332: \end{equation}
333: This dynamical system arises as order parameter equations for instabilities
334: in nonequilibrium systems and has applications
335: from the fields of pattern formation in nonequilibrium systems to pattern
336: recognition \cite{Haken1}. It exhibits the features of
337: multistability and selection. We considered the case
338: $\epsilon=0.25$ and $B=2$ (time series II).
339: These parameters yield four stable fixpoints of the dynamics
340: on the axes at $|{\bf q}|=1/2$ and unstable fixpoints at the
341: origin and on the bisectional lines at $|{\bf q}|=\sqrt{6}/6$.
342:
343: Data with time increments $10^{-1}$
344: for the datapoints
345: has been generated with a time step
346: $10^{-5}$ for the integration of the Langevin equations.
347: The simulated time series II with $Q=.05$ consists
348: of $5\cdot 10^{6}$ data points.
349: Figure~\ref{feb002.dat.1d} exhibits a segment of the generated data.
350:
351: We analyzed the time series as outlined above.
352: State space in this case is divided in $100\times 100$ equidistant bins.
353: Since the drift ${\bf D}^{1}({\bf q})$ can be evaluated
354: from (\ref{add_final}) all parameters are fixed except for the
355: noise strength $Q$.
356:
357: After evaluating the Kullback measure for various values of
358: $Q$ this value has to be optimized. The optimal value is
359: determined by the minimum of the
360: Kullback distance. For the present case the minimum can easily be
361: determined by graphical means.
362:
363: \begin{figure}
364: \begin{center}
365: \includegraphics[width=8.6cm]{feb002.drift.eps}
366: \end{center}
367: \caption{Time series II: Drift vector field extracted from data
368: using the optimal value of $Q$. Unstable fixpoints in the center and
369: on the bisectional line as well as
370: the attractive fixpoints are clearly visible.}
371: \label{feb002.drift}
372: \end{figure}
373:
374: Fig.~\ref{feb002.out} shows the Kullback distance
375: $K(Q)$ as a function of the noise strength Q for the time series II.
376: The minimum is clearly visible at $Q=0.05$ and agrees with the one
377: used for simulation. With this parameter the drift
378: vector field can be recalculated
379: from the stationary distribution
380: based on relation
381: (\ref{add_final}).
382: The resulting drift vector field of dataset II is exhibited
383: in fig.~\ref{feb002.drift}.
384:
385: \section{Conclusion}
386:
387: Summarizing, we have outlined an operational method for the estimation
388: of drift and diffusion terms from experimental time series of
389: stochastic Langevin processes. In contrast to previous approaches the
390: present algorithm does not rely on estimating conditional moments in
391: the small time increment limit. Although this limit
392: yields a first
393: approximation an iterative refinement of the estimated stochastic
394: process is performed by minimization of the Kullback-Leibler distance between
395: estimated and measured two time probability distributions.
396: The proposed procedure solves the problem of estimating drift and
397: diffusion terms of Langevin processes from time series.
398: It involves the numerical solution of Langevin equations with
399: parameter dependent drift and diffusion terms, an evaluation of
400: the Kullback-Leibler integral (which may be
401: determined by means of a Monte-Carlo method) and an
402: optimization procedure, for which standard approaches
403: can be used. All involved steps are based on routine calculations.
404: Furthermore, restriction to certain classes of
405: stochastic processes like potential systems can drastically lower
406: the numerical efforts of the procedure. Therefore, the proposed
407: algorithm can be applied also to systems with higher dimensional
408: state spaces.
409:
410: \begin{thebibliography}{}
411: \bibitem{Haken1} H. Haken, {\em Synergetics: Introduction and Advanced
412: Topics}, Springer Verlag Berlin Heidelberg New York (2004)
413: \bibitem{Risken} H. Risken, {\em The Fokker-Planck equation},
414: Springer-Verlag Berlin Heidelberg New-York Tokyo (1983)
415: \bibitem{Gardiner} C. W. Gardiner, {\em Handbook of Stochastic
416: Methods}, Springer-Verlag Berlin Heidelberg New-York Tokyo (1983)
417: \bibitem{Haken2} H. Haken, {\em Information and Self-Organization-
418: A macroscopic approach to complex systems},
419: Springer Verlag Berlin Heidelberg New York (2004)
420: \bibitem{Siegert1} S. Siegert, R. Friedrich, J. Peinke,
421: Phys. Lett. {\bf A 234},
422: 275-280 (1998)
423: \bibitem{Siegert2} R. Friedrich, S. Siegert, J. Peinke, St. L\"uck,
424: M. Siefert, M. Lindemann, J. Raethjen, G. Deuschl, G. Pfister,
425: Phys. Lett.
426: {\bf A 271}, 217 (2000)
427: \bibitem{Grad1} J. Gradisek, I. Grabec, S. Siegert, R. Friedrich,
428: Mechanical Systems and Signal Processing {\bf 16}
429: (5), 831 (2002)
430: \bibitem{Frank1} T. D. Frank, P. J. Beek, R. Friedrich, Phys. Lett. {\bf A
431: 328}, 219 (2004), T. D. Frank, R. Friedrich, P.J. Beek, Stochastics and
432: Dynamics {\bf 9}, 44 (2004)
433: \bibitem{Sura1} P. Sura, S.T. Gille, Journal of
434: Marine Research {\bf 61}, 313 (2003)
435: \bibitem{Sura2} P. Sura, Journal of the Atmospheric Sciences
436: {\bf 60}, 654 (2003)
437: \bibitem{Kriso} S. Kriso, R. Friedrich, J. Peinke, P. Wagner,
438: Phys. Lett. {\bf A
439: 299}, 287 (2002)
440: \bibitem{Kuusela04} T. Kuusela, Phys. Rev. {\bf E 69}, 031916 (2004)
441: \bibitem{PRL} R. Friedrich, J. Peinke,
442: Phys. Rev. Lett. {\bf 78}, 863 (1997)
443: \bibitem{JFM} Ch. Renner, J. Peinke, R. Friedrich,
444: J. Fluid Mech. {\bf 433}, 383 (2001)
445: \bibitem{Tutku} M. Tutkun, L. Mydlarski, New Journal
446: of Physics {\bf 6}, Art. No. 49 (2004)
447: \bibitem{PRLfinanz} R. Friedrich, J. Peinke, Ch. Renner,
448: Phys. Rev. Lett. {\bf 84}, 5224 (2000)
449: \bibitem{Jafari} G. R. Jafari, S. M. Fazeli, F. Ghasemi,
450: S. M. V. Allaei,
451: M. R. R. Tabar, A. I. Zad, G. Kavei, Phys. Rev. Lett. {\bf 91},
452: 226101 (2003)
453: \bibitem{Waechter} M. W\"achter, F. Riess, H. Kantz, J. Peinke,
454: Europhys. Lett. {\bf 64}, 579 (2003)
455: \bibitem{Sura} P. Sura, j. Barsugli, Phys. Lett. {\bf A 305},
456: 304 (2002)
457: \bibitem{Ragw} M. Ragwitz, H. Kantz, Phys. Rev. Lett.
458: {\bf 87}, 254501 (2001)
459: \bibitem{Kantzcom} R. Friedrich, Ch. Renner, M. Siefert, J. Peinke,
460: Phys. Rev. Lett. 89, 149401 (2002)
461: \bibitem{Kantzrepl} M. Ragwitz, H. Kantz,
462: Phys. Rev. Lett. 89, 149402 (2002)
463: \bibitem{Siefert} M. Siefert, A. Kittel, R. Friedrich, J. Peinke,
464: Europhys. Lett. {\bf 61} (4), 466 (2003)
465: \bibitem{weinstein90}
466: E. Weinstein, M. Feder, and A.~V. Oppenheim, IEEE Transactions on Acoustics,
467: Speech and Signal Processing {\bf 38}, 1652 (1990)
468: \end{thebibliography}
469:
470: \end{document}
471: