0709.2416/Oh.tex
1: %\documentclass[preprint,superscriptaddress,showpacs,showkeys]{revtex4}
2: \documentclass[preprint,showpacs,showkeys]{revtex4}
3: \usepackage{bm}
4: \usepackage{graphicx}
5: \usepackage{amsmath, amsthm, amsfonts,amssymb}
6: 
7: \begin{document}
8: 
9: \title{Measuring Volatility Clustering in Stock Markets}
10: 
11: \author{Gabjin Oh}
12: \email{gq478051@postech.ac.kr}
13: \affiliation{NCSL, Department of Physics, Pohang University of Science and Technology, Pohang, Gyeongbuk, 790-784, Korea}
14: \affiliation{Asia Pacific Center for Theoretical Physics, Pohang, Gyeongbuk, 790-784, Korea}
15: 
16: 
17: \author{Cheoljun Eom}
18: \email{shunter@pusan.ac.kr}
19: \affiliation{Division of Business Administration, Pusan National University, Busan 609-735, Korea}
20: 
21: \author{Seunghwan Kim}
22: \email{swan@postech.ac.kr}
23: \affiliation{NCSL, Department of Physics, Pohang University of Science and Technology, Pohang, Gyeongbuk, 790-784, Korea}
24: \affiliation{Asia Pacific Center for Theoretical Physics, Pohang, Gyeongbuk, 790-784, Korea}
25: 
26: \author{Taehyuk Kim}
27: \email{tahykim@pusan.ac.kr}
28: \affiliation{Division of Business Administration, Pusan National University, Busan 609-735, Korea}
29: 
30: \begin{abstract}
31: We propose a novel method to quantify the clustering behavior in a
32: complex time series and apply it to a high-frequency data of the
33: financial markets. We find that regardless of used data sets, all
34: data exhibits the volatility clustering properties, whereas those
35: which filtered the volatility clustering effect by using the GARCH
36: model reduce volatility clustering significantly. The result
37: confirms that our method can measure the volatility clustering
38: effect in financial market.
39: \end{abstract}
40: 
41: \pacs{87.10.+e, 89.20.-a, 87.90.+y}
42: \keywords {econophysics, volatility clustering, GARCH}
43: \maketitle
44: 
45: \section{Introduction}
46: Recently, financial markets have been known as representatives of
47: complex system, which changes the property of system dynamically
48: according to inflows of various information from outside and
49: interactions between heterogenous agents [1]. In order to
50: understand the complexity of financial market, the methods of
51: interdisciplinary research have been achieving in physics and
52: economics fields. The various stylized facts such as  the
53: long-term memory of volatility [2], volatility clustering [3], fat
54: tails [4], multifractality [5, 6] are observed. The obvious
55: properties among the stylized facts are the long-term memory
56: property and clustering effects of the volatility data. In
57: previous studies, clustering behaviors are shown in the return
58: time interval statistics of the climate records [7], medical data,
59: extreme floods [8], and economics [9].
60: 
61: The various models which reflects the volatility clustering effect
62: in order to predict exactly the volatility in the econometrics
63: field are introduced. The autoregressive conditional
64: heteroskedasticity (ARCH) [10] and the generalized autoregressive
65: conditional heteroskedasticity (GARCH) model [11] are the
66: representatives. Namely, the many researches to understand the
67: micro-mechanism of market has been processed. However, the study
68: to quantify the volatility clustering effects is not sufficient
69: yet. If we observe quantitatively the volatility clustering effect
70: in financial markets, we will understand micro phenomena of the
71: market. In this paper, we propose the novel method to quantify the
72: volatility clustering effect in the financial time series.
73: 
74: We find that all data sets analyzed in this paper exhibit the volatility clustering property, whereas the data which filters the volatility clustering effect by the GARCH(1,1) model reduces the degree of volatility clustering significantly.
75: 
76: In the next section, we describe the data sets and methods used in this paper. In section \ref{sec:RESULTS}, we preset our results of this study. Section \ref{sec:CONCLUSIONS} concludes.
77: 
78: \section{DATA and METHODS}
79: 
80: \subsection{DATA}
81: 
82: We investigate quantitatively the volatility clustering behaviors using financial time series including the following market data sets: the 5 minute S\&P 500 index from 1995 to 2004 and the 5 minute 28 individual stocks traded in the NYSE with the largest liquidity from 1993 to 2002. The return time series $r(t)$ is calculated by the
83: log-difference of high-frequency prices as follows: $r(t) = \ln P(t) -
84: \ln P(t-1)$ where $P(t)$ represents the stock price at time $t$.
85: 
86: \subsection{Method to quantify the volatility clustering}
87: 
88: In this subsection, we propose a novel method to quantify the
89: volatility clustering effect. We estimate and analyze quantitatively the volatility clustering
90: effect existed in the financial time series. The process is explained by the following.
91: 
92: \textbf{Step 1} (The symbolized process): We transfer the return
93: time series $r(t)$ to the symbolic data $s(t)$ in order to
94: quantify the volatility clustering effect in the financial data
95: using the control parameter, such as the number of bins which is
96: defined as
97: 
98: \begin{equation}
99: \{ S(t)=T_{i},  ~~~~~if  ~~ r(t) \in T_{i} \}, ~~ T_{i} = \{T_{1}, T_{2}, \cdots, T_{N_{b}}\}
100: \end{equation}
101: where $N_b$ is the number of bins. The conditional distribution with statistical significance is calculated by the symbolized process.
102: 
103: \textbf{Step 2} (Calculating the conditional distribution): We estimate the conditional distribution using the symbolized time series generated in step 1. In other words, the conditional distribution corresponds to the next value of a specific symbol $S_T$ in the symbolic data. Next, we calculate repeatedly the conditional distribution $P(S_j | S_T)$ for all symbolic data in the proper regime. The conditional distribution of each symbolic data has a non-trivial property like the conditional value $S_T$ if there is a volatility clustering behavior.
104: 
105: \textbf{Step 3} (The average value of conditional distribution): The step 3 is the calculation of the average value of the conditional distribution estimated in the step 2. We only consider the conditional distribution of symbolic data in the proper range because the extreme symbolic data are rare. By the average value of the conditional distribution, we observe the volatility clustering effect defined as
106: 
107: \begin{equation}
108: \overline {S_{T}} = \frac{1}{N_T} \sum_{j=1}^{N_T}|P(S_j | S_T)|
109: \end{equation}
110: where $N_T$ is the element numbers of the conditional distribution
111: $P(S_j | S_T)$ in terms of a specific symbol $S_T$. If the average
112: value is not dependent on the symbolic value $S_T$, there is no
113: volatility clustering effect because the time series shows the
114: volatility clustering effect only when it has a positive
115: (negative) relation with the positive (negative) values of
116: $S_{T}$. Next, we calculate the relation between the specific
117: symbolic values $S_{T}$ and average values $\overline{S_{T}}$ of
118: conditional distribution. In other words, we observe the degree of
119: volatility clustering (DVC) behavior according to the relationship
120: between $\overline{S_{T}}$ and $S_{T}$. The average value
121: $\overline{S_{T}}$ is definded as
122: 
123: \begin{displaymath}
124: \overline{S_T} = \left\{
125: \begin{array}{ll}
126: DVC^{P} \times S_T &S_t \geq 0\\
127: DVC^{N} \times S_T &\textrm{otherwise}\\
128: \end{array}
129: \right.
130: \end{displaymath}
131: where $DVC^{P,N}$ is the degree of volatility clustering effect
132: for the positive and negative cases respectively. When $DVC^{P,N}
133: = 0$, there is no clustering effect. However, when the value of
134: $DVC^{P,N}$ is nonzero, the degree of volatility clustering effect
135: according to the relative magnitude of $DVC^{P,N}$ is measured.
136: Therefore, we can estimate quantitatively the volatility
137: clustering effect of the financial time series.
138: 
139: \section{Results}
140: \label{sec:RESULTS}
141: 
142: In this section, we present the volatility clustering effect of the financial time series. In order to verify usefulness of the method proposed in this paper, we employ the GARCH(1,1) which reflects the volatility clustering effect.
143: 
144: First of all, we apply the novel method to the 5 minute S\&P500 index and calculate the degree of volatility clustering. Fig. 1a represents the return time series of the 5 minute S\&P500 index and Fig. 1b shows its symbolic time series. We then calculate the conditional distribution $P(S|S_T)$ of a specific symbol data $S_T$. Fig. 1c shows the conditional distributions of specific symbols. In Fig. 1c, we find that the width of the conditional distribution increases as the value of symbolic data increases. In other words, the width of the conditional distribution of small symbolic data is relatively narrow than that of large symbolic data. The average value for the conditional distribution regarding specific symbolic data $S_T$ is calculated in order to observe the relationship between  specific symbolic values and its conditional distribution. Fig. 1d shows the relationship between specific symbol and average value. Circles and squares of Fig. 1d indicate the original and the surrogate time series respectively. We find that the average values of the conditional distribution for the original time series are positively related to the magnitude of specific symbolic value, $DVC_{S\&P500}^{P}=0.57$ and $DVC_{S\&P500}^{N}=0.52$, while those for the shuffled time series is not dependent on the symbolic value $S_{T}$. The return time series of the S\&P500 index shows the volatility clustering effect, the larger (small) values follow the larger (small) values.
145: 
146: Next, we utilize the GARCH model to verify the usefulness of our
147: method. The GARCH model generates the volatility clustering
148: effect. We create the new time series with the volatility
149: clustering effect removed by the GARCH(1,1) filtering model and
150: estimate the degree of the volatility clustering effect. Fig. 2
151: displays the degree of the volatility clustering for the 28
152: individual stocks traded in the NYSE stock market with the largest
153: liquidity. The circles (red), the diamonds (blue), the squares
154: (green), and the triangles (pink) indicate the degree of the
155: volatility clustering for the positive and negative return time
156: series using the original and the GARCH(1,1) filtering data,
157: respectively. In Fig. 2, we find that all the stock return time
158: series, regardless of individual stocks, have the volatility
159: clustering effect, $0.38 \leq DVC^P \leq 0.72$ and $-0.69 \leq
160: DVC^N \leq -0.32$. However, after eliminating the volatility
161: clustering behavior by the GARCH(1,1) model, the degree of the
162: volatility clustering effect is reduced significantly. This
163: supports that our method to quantify the volatility clustering
164: effect in financial time series is working well.
165: 
166: \section{Conclusions}
167: \label{sec:CONCLUSIONS}
168: 
169: We proposed the novel method to quantify the volatility clustering
170: behavior in financial time series and calculated the degree of the
171: volatility clustering (DVC) using the diverse stock prices. First,
172: we found that all financial data analyzed exhibited the volatility
173: clustering properties, whereas those which are filtered the
174: volatility clustering effect by the GARCH(1,1) model reduced the
175: degree of the volatility clustering effect significantly. This
176: result confirmed that our method calculated the volatility
177: clustering effect in financial time series well. Our method might
178: be applied to elaborate clustering analysis of diverse complex
179: signals including climate, HRV as well as financial time series.
180: Further studies on the volatility clustering will examine to the
181: above systems more extensively.
182: 
183: 
184: This work was supported by the Korea Research Foundation funded by
185: the Korean Government (MOEHRD) (KRF-2005-042-B00075), and the
186: MOST/KOSEF to the National Core Research Center for Systems
187: Bio-Dynamics (R15-2004-033), and by the Ministry of Science \&
188: Technology through the National Research Laboratory Project, and
189: by the Ministry of Education through the program BK 21.
190: 
191: 
192: 
193: \begin{thebibliography}{00}
194: 
195: \bibitem{1}
196: R. N. Mantegna and H. E. Stanley, An Introduction to Econophysics:
197: Correlation and Complexity in Finance (Cambridge University Press,
198: Cambridge, U.K., 1999); J-P. Bouchaud, M. Potters, Theory of
199: Financial Risk and Derivative Pricing: From Statistical Physics to
200: Risk Management (Cambridge University Press, Cambridge, USA,
201: 2004);
202: 
203: \bibitem{2}
204: G. Oh \textit{et al.}, J. Korean Phys. Soc. \textbf{48}, 197 (2006);
205: Y. Liu \textit{et al.}, Phys. Rev. E \textbf{60}, 1390 (1999);
206: T. Di Matteo, Journal of Banking \& Finance \textbf{29}, 827 (2005);
207: W. Lo Andrew, Econometrica \textbf{59}, 1279 (1991).
208: 
209: \bibitem{3}
210: I. Giardina \textit{et al.}, Physica A \textbf{324}, 6 (2003);
211: B. Jacobsen, Journal of Empirical Finance \textbf{10}, 479 (2003).
212: 
213: \bibitem{4}
214: R. N. Mantegna \textit{et al.}, Nature (London) \textbf{376}, 46 (1995);
215: R. N. Mantegna \textit{et al.}, Nature (London) \textbf{383}, 587 (1996);
216: V. Plerou \textit{et al.}, Nature (London) \textbf{421}, 130 (2003);
217: X. Gabaix \textit{et al.}, Nature (London) \textbf{423}, 267 (2003).
218: 
219: \bibitem{Sauer}
220: J. F. Muzy \textit{et al.}, Eur. Phys. J. B \textbf{17}, 537 (2000);
221: Z. Eisler \textit{et al.}, Physica A \textbf{434}, 603 (2004);
222: L. Calvet \textit{et al.}, J. of Econometrics \textbf{105}, 27 (2001);
223: P.C. Ivanov \textit{et al.}, Nature (London) \textbf{399}, 461 (1999);
224: R. B. Gobindan and H. Kantz, Europhys, Lett. \textbf{68}, 184 (2004);
225: Y. Ashkenazy \textit{et al.}, Phys. Rev. Lett. \textbf{86}, 1900 (2001).
226: 
227: \bibitem{Jacobsen}
228: J. F. Muzy \textit{et al.}, Int. J. Bifurcation Chaos Appl. Sci. Eng. \textbf{4}, 245 (1994).
229: 
230: \bibitem{Hiemstra}
231: A. Bunde \textit{et al.}, Phys. Rev. Lett. \textbf{94}, 048701 (2005).
232: 
233: \bibitem{Willinger}
234: M. Mudelsee \textit{et al.}, Nature (London) \textbf{425}, 166 (2003).
235: 
236: \bibitem{Grau}
237: K. Yamasaki \textit{et al.}, Proc. Natl. Acad. Sci. \textbf{102}, 9424 (2005);
238: F. Wang \textit{et al.}, Phys. Rev. E \textbf{73}, 066128 (2006).
239: 
240: \bibitem{Gabjin1}
241: R. F. Engle, Econometrica \textbf{50}, 987 (1982).
242: 
243: \bibitem{Cajueiro}
244: T. Bollerslev, J. Econometrics \textbf{31}, 307 (1986).
245: 
246: 
247: 
248: \newpage
249: 
250: \begin{figure}
251: 
252: \includegraphics[width=1.0\textwidth]{OHFig1.eps}
253: 
254: \caption[0]{(a) The return time series of the 5 minute S\&P500
255: index for 9 years from 1995 to 2004. (b) Its symbolics time
256: series. (c) The conditional distributions of specific symbolic
257: values. Each emblem indicate the specific symbolic value. (d) The
258: degree of the volatility clustering of the S\&P500 index. The
259: circles and the squares indicate the shuffled and the original
260: time series respectively. }
261: \end{figure}
262: 
263: 
264: \begin{figure}
265: 
266: \includegraphics[width=1.0\textwidth]{OHFig1.eps}
267: \caption[0]{The degree of the volatility clustering for the 28
268: individual stocks traded in the NYSE from 1993 to 2002 and its
269: GARCH filtering data. The circles (red), the diamonds (blue), the
270: squares (green), and the triangles (pink) indicate the positive
271: and the negative data of original and GARCH(1,1) filtering data
272: respectively.}
273: \end{figure}
274: 
275: 
276: %\begin{figure}[tb]
277: 
278: %\includegraphics[height=10cm, width=16cm]{F3.eps}
279: 
280: %\caption[0]{The degree of volatility clustering of the 28
281: %individual stocks and traded on the NYSE from 1993 to 2002 and its
282: %surrogate data eliminated the non-linearity by phase
283: %randomization. The circles (red), diamonds (blue), Squares
284: %(green), and triangles (pink) indicates the positive and negative
285: %data of original and surrogate data, respectively.}
286: %\end{figure}
287: 
288: 
289: 
290: 
291: \end{thebibliography}
292: 
293: \end{document}
294: