0709:0709.2416/Oh.tex

1: %\documentclass[preprint,superscriptaddress,showpacs,showkeys]{revtex4}

2: \documentclass[preprint,showpacs,showkeys]{revtex4}

3: \usepackage{bm}

4: \usepackage{graphicx}

5: \usepackage{amsmath, amsthm, amsfonts,amssymb}

6:

7: \begin{document}

8:

9: \title{Measuring Volatility Clustering in Stock Markets}

10:

11: \author{Gabjin Oh}

12: \email{gq478051@postech.ac.kr}

13: \affiliation{NCSL, Department of Physics, Pohang University of Science and Technology, Pohang, Gyeongbuk, 790-784, Korea}

14: \affiliation{Asia Pacific Center for Theoretical Physics, Pohang, Gyeongbuk, 790-784, Korea}

15:

16:

17: \author{Cheoljun Eom}

18: \email{shunter@pusan.ac.kr}

19: \affiliation{Division of Business Administration, Pusan National University, Busan 609-735, Korea}

20:

21: \author{Seunghwan Kim}

22: \email{swan@postech.ac.kr}

23: \affiliation{NCSL, Department of Physics, Pohang University of Science and Technology, Pohang, Gyeongbuk, 790-784, Korea}

24: \affiliation{Asia Pacific Center for Theoretical Physics, Pohang, Gyeongbuk, 790-784, Korea}

25:

26: \author{Taehyuk Kim}

27: \email{tahykim@pusan.ac.kr}

28: \affiliation{Division of Business Administration, Pusan National University, Busan 609-735, Korea}

29:

30: \begin{abstract}

31: We propose a novel method to quantify the clustering behavior in a

32: complex time series and apply it to a high-frequency data of the

33: financial markets. We find that regardless of used data sets, all

34: data exhibits the volatility clustering properties, whereas those

35: which filtered the volatility clustering effect by using the GARCH

36: model reduce volatility clustering significantly. The result

37: confirms that our method can measure the volatility clustering

38: effect in financial market.

39: \end{abstract}

40:

41: \pacs{87.10.+e, 89.20.-a, 87.90.+y}

42: \keywords {econophysics, volatility clustering, GARCH}

43: \maketitle

44:

45: \section{Introduction}

46: Recently, financial markets have been known as representatives of

47: complex system, which changes the property of system dynamically

48: according to inflows of various information from outside and

49: interactions between heterogenous agents [1]. In order to

50: understand the complexity of financial market, the methods of

51: interdisciplinary research have been achieving in physics and

52: economics fields. The various stylized facts such as  the

53: long-term memory of volatility [2], volatility clustering [3], fat

54: tails [4], multifractality [5, 6] are observed. The obvious

55: properties among the stylized facts are the long-term memory

56: property and clustering effects of the volatility data. In

57: previous studies, clustering behaviors are shown in the return

58: time interval statistics of the climate records [7], medical data,

59: extreme floods [8], and economics [9].

60:

61: The various models which reflects the volatility clustering effect

62: in order to predict exactly the volatility in the econometrics

63: field are introduced. The autoregressive conditional

64: heteroskedasticity (ARCH) [10] and the generalized autoregressive

65: conditional heteroskedasticity (GARCH) model [11] are the

66: representatives. Namely, the many researches to understand the

67: micro-mechanism of market has been processed. However, the study

68: to quantify the volatility clustering effects is not sufficient

69: yet. If we observe quantitatively the volatility clustering effect

70: in financial markets, we will understand micro phenomena of the

71: market. In this paper, we propose the novel method to quantify the

72: volatility clustering effect in the financial time series.

73:

74: We find that all data sets analyzed in this paper exhibit the volatility clustering property, whereas the data which filters the volatility clustering effect by the GARCH(1,1) model reduces the degree of volatility clustering significantly.

75:

76: In the next section, we describe the data sets and methods used in this paper. In section \ref{sec:RESULTS}, we preset our results of this study. Section \ref{sec:CONCLUSIONS} concludes.

77:

78: \section{DATA and METHODS}

79:

80: \subsection{DATA}

81:

82: We investigate quantitatively the volatility clustering behaviors using financial time series including the following market data sets: the 5 minute S\&P 500 index from 1995 to 2004 and the 5 minute 28 individual stocks traded in the NYSE with the largest liquidity from 1993 to 2002. The return time series $r(t)$ is calculated by the

83: log-difference of high-frequency prices as follows: $r(t) = \ln P(t) -

84: \ln P(t-1)$ where $P(t)$ represents the stock price at time $t$.

85:

86: \subsection{Method to quantify the volatility clustering}

87:

88: In this subsection, we propose a novel method to quantify the

89: volatility clustering effect. We estimate and analyze quantitatively the volatility clustering

90: effect existed in the financial time series. The process is explained by the following.

91:

92: \textbf{Step 1} (The symbolized process): We transfer the return

93: time series $r(t)$ to the symbolic data $s(t)$ in order to

94: quantify the volatility clustering effect in the financial data

95: using the control parameter, such as the number of bins which is

96: defined as

97:

98: \begin{equation}

99: \{ S(t)=T_{i},  ~~~~~if  ~~ r(t) \in T_{i} \}, ~~ T_{i} = \{T_{1}, T_{2}, \cdots, T_{N_{b}}\}

100: \end{equation}

101: where $N_b$ is the number of bins. The conditional distribution with statistical significance is calculated by the symbolized process.

102:

103: \textbf{Step 2} (Calculating the conditional distribution): We estimate the conditional distribution using the symbolized time series generated in step 1. In other words, the conditional distribution corresponds to the next value of a specific symbol $S_T$ in the symbolic data. Next, we calculate repeatedly the conditional distribution $P(S_j | S_T)$ for all symbolic data in the proper regime. The conditional distribution of each symbolic data has a non-trivial property like the conditional value $S_T$ if there is a volatility clustering behavior.

104:

105: \textbf{Step 3} (The average value of conditional distribution): The step 3 is the calculation of the average value of the conditional distribution estimated in the step 2. We only consider the conditional distribution of symbolic data in the proper range because the extreme symbolic data are rare. By the average value of the conditional distribution, we observe the volatility clustering effect defined as

106:

107: \begin{equation}

108: \overline {S_{T}} = \frac{1}{N_T} \sum_{j=1}^{N_T}|P(S_j | S_T)|

109: \end{equation}

110: where $N_T$ is the element numbers of the conditional distribution

111: $P(S_j | S_T)$ in terms of a specific symbol $S_T$. If the average

112: value is not dependent on the symbolic value $S_T$, there is no

113: volatility clustering effect because the time series shows the

114: volatility clustering effect only when it has a positive

115: (negative) relation with the positive (negative) values of

116: $S_{T}$. Next, we calculate the relation between the specific

117: symbolic values $S_{T}$ and average values $\overline{S_{T}}$ of

118: conditional distribution. In other words, we observe the degree of

119: volatility clustering (DVC) behavior according to the relationship

120: between $\overline{S_{T}}$ and $S_{T}$. The average value

121: $\overline{S_{T}}$ is definded as

122:

123: \begin{displaymath}

124: \overline{S_T} = \left\{

125: \begin{array}{ll}

126: DVC^{P} \times S_T &S_t \geq 0\\

127: DVC^{N} \times S_T &\textrm{otherwise}\\

128: \end{array}

129: \right.

130: \end{displaymath}

131: where $DVC^{P,N}$ is the degree of volatility clustering effect

132: for the positive and negative cases respectively. When $DVC^{P,N}

133: = 0$, there is no clustering effect. However, when the value of

134: $DVC^{P,N}$ is nonzero, the degree of volatility clustering effect

135: according to the relative magnitude of $DVC^{P,N}$ is measured.

136: Therefore, we can estimate quantitatively the volatility

137: clustering effect of the financial time series.

138:

139: \section{Results}

140: \label{sec:RESULTS}

141:

142: In this section, we present the volatility clustering effect of the financial time series. In order to verify usefulness of the method proposed in this paper, we employ the GARCH(1,1) which reflects the volatility clustering effect.

143:

144: First of all, we apply the novel method to the 5 minute S\&P500 index and calculate the degree of volatility clustering. Fig. 1a represents the return time series of the 5 minute S\&P500 index and Fig. 1b shows its symbolic time series. We then calculate the conditional distribution $P(S|S_T)$ of a specific symbol data $S_T$. Fig. 1c shows the conditional distributions of specific symbols. In Fig. 1c, we find that the width of the conditional distribution increases as the value of symbolic data increases. In other words, the width of the conditional distribution of small symbolic data is relatively narrow than that of large symbolic data. The average value for the conditional distribution regarding specific symbolic data $S_T$ is calculated in order to observe the relationship between  specific symbolic values and its conditional distribution. Fig. 1d shows the relationship between specific symbol and average value. Circles and squares of Fig. 1d indicate the original and the surrogate time series respectively. We find that the average values of the conditional distribution for the original time series are positively related to the magnitude of specific symbolic value, $DVC_{S\&P500}^{P}=0.57$ and $DVC_{S\&P500}^{N}=0.52$, while those for the shuffled time series is not dependent on the symbolic value $S_{T}$. The return time series of the S\&P500 index shows the volatility clustering effect, the larger (small) values follow the larger (small) values.

145:

146: Next, we utilize the GARCH model to verify the usefulness of our

147: method. The GARCH model generates the volatility clustering

148: effect. We create the new time series with the volatility

149: clustering effect removed by the GARCH(1,1) filtering model and

150: estimate the degree of the volatility clustering effect. Fig. 2

151: displays the degree of the volatility clustering for the 28

152: individual stocks traded in the NYSE stock market with the largest

153: liquidity. The circles (red), the diamonds (blue), the squares

154: (green), and the triangles (pink) indicate the degree of the

155: volatility clustering for the positive and negative return time

156: series using the original and the GARCH(1,1) filtering data,

157: respectively. In Fig. 2, we find that all the stock return time

158: series, regardless of individual stocks, have the volatility

159: clustering effect, $0.38 \leq DVC^P \leq 0.72$ and $-0.69 \leq

160: DVC^N \leq -0.32$. However, after eliminating the volatility

161: clustering behavior by the GARCH(1,1) model, the degree of the

162: volatility clustering effect is reduced significantly. This

163: supports that our method to quantify the volatility clustering

164: effect in financial time series is working well.

165:

166: \section{Conclusions}

167: \label{sec:CONCLUSIONS}

168:

169: We proposed the novel method to quantify the volatility clustering

170: behavior in financial time series and calculated the degree of the

171: volatility clustering (DVC) using the diverse stock prices. First,

172: we found that all financial data analyzed exhibited the volatility

173: clustering properties, whereas those which are filtered the

174: volatility clustering effect by the GARCH(1,1) model reduced the

175: degree of the volatility clustering effect significantly. This

176: result confirmed that our method calculated the volatility

177: clustering effect in financial time series well. Our method might

178: be applied to elaborate clustering analysis of diverse complex

179: signals including climate, HRV as well as financial time series.

180: Further studies on the volatility clustering will examine to the

181: above systems more extensively.

182:

183:

184: This work was supported by the Korea Research Foundation funded by

185: the Korean Government (MOEHRD) (KRF-2005-042-B00075), and the

186: MOST/KOSEF to the National Core Research Center for Systems

187: Bio-Dynamics (R15-2004-033), and by the Ministry of Science \&

188: Technology through the National Research Laboratory Project, and

189: by the Ministry of Education through the program BK 21.

190:

191:

192:

193: \begin{thebibliography}{00}

194:

195: \bibitem{1}

196: R. N. Mantegna and H. E. Stanley, An Introduction to Econophysics:

197: Correlation and Complexity in Finance (Cambridge University Press,

198: Cambridge, U.K., 1999); J-P. Bouchaud, M. Potters, Theory of

199: Financial Risk and Derivative Pricing: From Statistical Physics to

200: Risk Management (Cambridge University Press, Cambridge, USA,

201: 2004);

202:

203: \bibitem{2}

204: G. Oh \textit{et al.}, J. Korean Phys. Soc. \textbf{48}, 197 (2006);

205: Y. Liu \textit{et al.}, Phys. Rev. E \textbf{60}, 1390 (1999);

206: T. Di Matteo, Journal of Banking \& Finance \textbf{29}, 827 (2005);

207: W. Lo Andrew, Econometrica \textbf{59}, 1279 (1991).

208:

209: \bibitem{3}

210: I. Giardina \textit{et al.}, Physica A \textbf{324}, 6 (2003);

211: B. Jacobsen, Journal of Empirical Finance \textbf{10}, 479 (2003).

212:

213: \bibitem{4}

214: R. N. Mantegna \textit{et al.}, Nature (London) \textbf{376}, 46 (1995);

215: R. N. Mantegna \textit{et al.}, Nature (London) \textbf{383}, 587 (1996);

216: V. Plerou \textit{et al.}, Nature (London) \textbf{421}, 130 (2003);

217: X. Gabaix \textit{et al.}, Nature (London) \textbf{423}, 267 (2003).

218:

219: \bibitem{Sauer}

220: J. F. Muzy \textit{et al.}, Eur. Phys. J. B \textbf{17}, 537 (2000);

221: Z. Eisler \textit{et al.}, Physica A \textbf{434}, 603 (2004);

222: L. Calvet \textit{et al.}, J. of Econometrics \textbf{105}, 27 (2001);

223: P.C. Ivanov \textit{et al.}, Nature (London) \textbf{399}, 461 (1999);

224: R. B. Gobindan and H. Kantz, Europhys, Lett. \textbf{68}, 184 (2004);

225: Y. Ashkenazy \textit{et al.}, Phys. Rev. Lett. \textbf{86}, 1900 (2001).

226:

227: \bibitem{Jacobsen}

228: J. F. Muzy \textit{et al.}, Int. J. Bifurcation Chaos Appl. Sci. Eng. \textbf{4}, 245 (1994).

229:

230: \bibitem{Hiemstra}

231: A. Bunde \textit{et al.}, Phys. Rev. Lett. \textbf{94}, 048701 (2005).

232:

233: \bibitem{Willinger}

234: M. Mudelsee \textit{et al.}, Nature (London) \textbf{425}, 166 (2003).

235:

236: \bibitem{Grau}

237: K. Yamasaki \textit{et al.}, Proc. Natl. Acad. Sci. \textbf{102}, 9424 (2005);

238: F. Wang \textit{et al.}, Phys. Rev. E \textbf{73}, 066128 (2006).

239:

240: \bibitem{Gabjin1}

241: R. F. Engle, Econometrica \textbf{50}, 987 (1982).

242:

243: \bibitem{Cajueiro}

244: T. Bollerslev, J. Econometrics \textbf{31}, 307 (1986).

245:

246:

247:

248: \newpage

249:

250: \begin{figure}

251:

252: \includegraphics[width=1.0\textwidth]{OHFig1.eps}

253:

254: \caption[0]{(a) The return time series of the 5 minute S\&P500

255: index for 9 years from 1995 to 2004. (b) Its symbolics time

256: series. (c) The conditional distributions of specific symbolic

257: values. Each emblem indicate the specific symbolic value. (d) The

258: degree of the volatility clustering of the S\&P500 index. The

259: circles and the squares indicate the shuffled and the original

260: time series respectively. }

261: \end{figure}

262:

263:

264: \begin{figure}

265:

266: \includegraphics[width=1.0\textwidth]{OHFig1.eps}

267: \caption[0]{The degree of the volatility clustering for the 28

268: individual stocks traded in the NYSE from 1993 to 2002 and its

269: GARCH filtering data. The circles (red), the diamonds (blue), the

270: squares (green), and the triangles (pink) indicate the positive

271: and the negative data of original and GARCH(1,1) filtering data

272: respectively.}

273: \end{figure}

274:

275:

276: %\begin{figure}[tb]

277:

278: %\includegraphics[height=10cm, width=16cm]{F3.eps}

279:

280: %\caption[0]{The degree of volatility clustering of the 28

281: %individual stocks and traded on the NYSE from 1993 to 2002 and its

282: %surrogate data eliminated the non-linearity by phase

283: %randomization. The circles (red), diamonds (blue), Squares

284: %(green), and triangles (pink) indicates the positive and negative

285: %data of original and surrogate data, respectively.}

286: %\end{figure}

287:

288:

289:

290:

291: \end{thebibliography}

292:

293: \end{document}

294: