0512:nlin0512014/d0.tex

1: \documentclass[12pt,a4paper]{article}

2:

3: \usepackage{ajr}

4:

5: \usepackage{defns}

6:

7: \begin{document}

8:

9: \title{Use the information dimension, not the Hausdorff}

10: \author{A.~J. Roberts\thanks{Dept.~Mathematics \& Computing, University

11: of Southern Queensland, Toowoomba, Queensland 4350, \textsc{Australia}.

12: \protect\url{mailto:aroberts@usq.edu.au}}}

13:

14: \date{Original version 1997, revised \today}

15:

16: \maketitle

17:

18: \begin{abstract}

19: Multi-fractal patterns occur widely in nature.  In developing new

20: algorithms to determine multi-fractal spectra of experimental data I

21: am lead to the conclusion that generalised dimensions~$D_q$ of order

22: $q\leq0$, including the Hausdorff dimension, are effectively

23: \emph{irrelevant}.  The reason is that these dimensions are

24: extraordinarily sensitive to regions of low density in the

25: multi-fractal data.  Instead, one should concentrate attention on

26: generalised dimensions~$D_q$ for $q\geq 1$, and of these the

27: information dimension~$D_1$ seems the most robustly estimated from a

28: finite amount of data.

29: \end{abstract}

30:

31: \tableofcontents

32:

33:

34: \section{Introduction}

35:

36: The characterisation of spatial distributions in terms of fractal

37: concepts~\cite{Mandelbrot79, Feder88} is becoming increasingly

38: important.  In particular, many distributions in nature are found to

39: have the characteristics of a multi-fractal~\cite{Hentschel83,

40: Halsey86, Paladin87}: among many examples are galaxy

41: clustering~\cite{Borgani93, Martinez91}, strange

42: attractors~\cite{Procaccia88a}, fluid turbulence~\cite{Sreenivasan91},

43: percolation~\cite{Isichenko92}, the shapes of

44: neurons~\cite{Jelinek01, Jelinek04},

45: and plant distributions~\cite{Emmerson95} and shapes~\cite{Jones96}.

46:

47: In application, methods for estimating fractal dimensions are often

48: unreliable.  One source of error lies in largely unknown biases

49: introduced by the finite size of data sets, addressed by

50: Grassberger~\cite{Grassberger88b}, and in the associated finite range

51: of length-scales inherent in gathered data.  In situations where

52: thousands or tens of thousands of data points are known such biases may

53: be minor; however, in some interesting problems, for example in the

54: spatial clustering of underwater plants~\cite{Emmerson95}, only of the

55: order of 100 data points are known and confidence in the fractal

56: characterisation may be misplaced.  We need to know more about factors

57: that cause errors in dimension estimates.

58:

59: Section~\ref{ss2} discusses the sensitivity of the multiplicative

60: multi-fractal process to regions of very low probability (measure).

61: Since such regions only rarely contribute a data point, an experimental

62: sample cannot discern them but such regions do affect the generalised

63: dimensions.  Hence I argue that the determination from experimental

64: data of generalised dimensions,~$D_q$, for non-positive~$q$ is

65: meaningless; for $0<q<1$ computations are very sensitive to the sample;

66: and thus the most robust fractal dimension is the information

67: dimension~$D_1$.  The argument is supported in Section~\ref{ss3} by a

68: maximum likelihood method~\cite{Roberts95b} of estimating the

69: multi-fractal properties of a data set.  The method shows the enormous

70: sensitivity of~$D_q$ for negative~$q$.  In contrast the information

71: dimension is reliably estimated.

72:

73:

74:

75:

76:

77: \section{Poor conditioning of generalised dimensions of negative order}

78: \label{ss2}

79:

80: For example, consider the Hausdorff dimension,~$D_0$, of multifractals

81: generated by two different ternary multiplicative process.

82: \begin{itemize}

83: 	\item Consider first the process shown in Figure~\ref{ftern}(a)

84: 	where an interval is divided into three thirds and the ``mass'' of

85: 	the original interval is assigned as follows: a fraction $f_1>0$

86: 	to the left third; a fraction $f_2=1-f_1>0$ to the right third;

87: 	and none to the middle third.  Repeat this subdivision

88: 	recursively.  This generates a multiplicative multifractal whose

89: 	Hausdorff dimension of $D_0=\log_32=0.6309$ is precisely the same

90: 	as the Cantor set because there is no ``mass'' in the middle

91: 	thirds.

92:

93: 	\item Conversely, and perversely, consider the process shown in

94: 	Figure~\ref{ftern}(b) where for some small~$\epsilon$ the ``mass''

95: 	is assigned as follows: a fraction $f_1>0$ is assigned to the left third;

96: 	a fraction $f_2>0$ is assigned to the rightmost third; and a small

97: 	fraction $\epsilon>0$ is assigned to the middle third (such that

98: 	$f_1+f_2+\epsilon=1$).  Repeat recursively.  This generates a

99: 	multiplicative multi-fractal whose Hausdorff dimension is $D_0=1$

100: 	because there is ``mass'' everywhere along the whole interval!

101: 	Although the vast bulk of the ``mass'' can be covered by~$2^n$

102: 	intervals of length~$3^{-n}$, we definitely do need~$3^n$

103: 	intervals in order to ensure coverage of the thinly spread

104: 	``mass'' that fills most of the original interval.

105: \end{itemize}

106: The importance of this for the analysis of an experimental data set of

107: $N$~sampled points is that one cannot tell the difference from the data

108: between these two multi-fractal generating processes for an

109: $\epsilon=\ord{1/N}$.  Thus one cannot estimate the Hausdorff

110: dimension~$D_0$ with any accuracy since either answer, $0.6309$~or~$1$

111: could be correct.

112: \begin{figure}[tbp]

113: 	\centerline{{\tt    \setlength{\unitlength}{0.075em}

114: \begin{picture}(402,196)

115: \thinlines    \put(242,18){$\epsilon f_2$}

116:               \put(170,18){$\epsilon f_1$}

117:               \put(206,18){$\epsilon^2$}

118:               \put(310,18){$f_2\epsilon$}

119:               \put(87,18){$f_1\epsilon$}

120:               \put(206,46){$\epsilon$}

121:               \put(336,10){\line(-1,0){37}}

122:               \put(151,10){\line(1,0){111}}

123:               \put(77,10){\line(1,0){37}}

124:               \put(151,40){\line(1,0){111}}

125:               \put(40,10){\begin{picture}(333,62)

126: \thicklines       \put(311,8){$f_2^2$}

127:                   \put(234,8){$f_2f_1$}

128:                   \put(88,8){$f_1f_2$}

129:                   \put(12,8){$f_1^2$}

130:                   \put(278,38){$f_2$}

131:                   \put(56,38){$f_1$}

132:                   \put(333,0){\line(-1,0){37}}

133:                   \put(222,0){\line(1,0){37}}

134:                   \put(111,0){\line(-1,0){37}}

135:                   \put(0,0){\line(1,0){37}}

136:                   \put(222,30){\line(1,0){111}}

137:                   \put(0,30){\line(1,0){111}}

138:                   \put(0,60){\line(1,0){333}}

139:                   \end{picture}}

140:                   \put(40,120){\begin{picture}(333,62)

141: \thicklines       \put(311,8){$f_2^2$}

142:                   \put(234,8){$f_2f_1$}

143:                   \put(88,8){$f_1f_2$}

144:                   \put(12,8){$f_1^2$}

145:                   \put(278,38){$f_2$}

146:                   \put(56,38){$f_1$}

147:                   \put(333,0){\line(-1,0){37}}

148:                   \put(222,0){\line(1,0){37}}

149:                   \put(111,0){\line(-1,0){37}}

150:                   \put(0,0){\line(1,0){37}}

151:                   \put(222,30){\line(1,0){111}}

152:                   \put(0,30){\line(1,0){111}}

153:                   \put(0,60){\line(1,0){333}}

154:                   \end{picture}}

155: \thicklines   \put(10,67){(b)}

156:               \put(10,177){(a)}

157: \end{picture}}}

158: 	\caption{schematic diagram of the first few stages in the

159: 	multiplicative multi-fractal process to illustrate the sensitivity

160: 	of the Hausdorff dimension~$D_0$ with respect to low density

161: 	regions,~(b), as a perturbation of the same process with zero

162: 	density regions,~(a).}

163: 	\protect\label{ftern}

164: \end{figure}

165:

166: Similar reasoning applies to generalised dimensions with negative~$q$.

167: Elementary arguments give that the generalised

168: dimensions~\cite{Hentschel83} of the multi-fractal generated by the

169: second process above are

170: \begin{equation}

171: 	D_q=\left\{

172: 	\begin{array}{ll}

173: 		\frac{-1}{q-1}\log_3\left[f_1^q+f_2^q+\epsilon^q\right] & \mbox{if

174: 		}q\neq 1\,,  \\

175: 		-\left[f_1\log_3f_1+f_2\log_3f_2+\epsilon\log_3\epsilon\right] &

176: 		\mbox{if }q=1\,.

177: 	\end{array}\right.

178: \end{equation}

179: It is readily appreciated that for negative order~$q$ and

180: small~$\epsilon$, the term~$\epsilon^q$ inside the logarithm

181: dominates the evaluation of the generalised dimension~$D_q$.  Hence, all

182: generalised dimensions for negative~$q$ are also extremely sensitive to

183: small~$\epsilon$.  In a data set obtained from experiments, one cannot

184: expect to distinguish between zero~$\epsilon$ and small non-zero

185: $\epsilon=\ord{1/N}$, and yet the generalised exponents and

186: multi-fractal spectrum are markedly different.  See Figure~\ref{fthe}

187: which plots the generalised dimensions for $f_1\approx1/4$,

188: $f_2\approx3/4$ and various small~$\epsilon$.

189: \begin{figure}[tbp]

190: 	\centerline{\includegraphics[width=0.95\textwidth]{gendim}}

191: 	\caption{multi-fractal generalised dimensions~$D_q$ for the

192: 	ternary multi-fractal process with $f_1=(1-\epsilon)/4$,

193: 	$f_2=(1-\epsilon)3/4$ and $\epsilon=0$ (solid), $0.01$ (dashed)

194: 	and $0.05$ (dotted).  This figure shows that $D_q$~for negative

195: 	order~$q$ is extraordinarily sensitive to small influences: the

196: 	curve of smaller~$\epsilon$ is the most changed.}

197: 	\protect\label{fthe}

198: \end{figure}

199:

200: We can be more precise about the sensitivity to low density regions

201: by computing the derivative of~$D_q$ with respect to~$\epsilon$.  For

202: definiteness, suppose $f_1=\phi_1(1-\epsilon)$ and

203: $f_2=\phi_2(1-\epsilon)$.  Then

204: \begin{equation}

205: 	\frac{\partial D_q}{\partial \epsilon}=\frac{-q}{q-1}

206: 	\frac{\epsilon^{q-1}-\left(\phi_1^q+\phi_2^q\right)(1-\epsilon)^{q-1}}%

207: {\log3\,\left[\epsilon^q+\left(\phi_1^q+\phi_2^q\right)(1-\epsilon)^q\right]}

208: \,.

209: 	\label{ede}

210: \end{equation}

211: For small, but non-zero, $\epsilon\to 0$ this asymptotes to

212: \begin{equation}

213: 	\frac{\partial D_q}{\partial\epsilon}\sim\frac{1}{\log3}\left\{

214: 	\begin{array}{ll}

215: 		\frac{q}{q-1} & \mbox{if }1<q\,,  \\

216: 		\frac{q}{(1-q)(\phi_1^q+\phi_2^q)}\epsilon^{q-1} & \mbox{if

217: 		}0<q<1\,,  \\

218: 		\frac{q}{1-q}\epsilon^{-1} & \mbox{if }\phantom{0<}q<0\,.

219: 	\end{array}

220: 	\right.

221: 	\label{easy}

222: \end{equation}

223: This derivative is unbounded as $\epsilon\to0$ for $q<1$, and so any

224: computation of~$D_q$ is only robust if $q\geq 1$.

225:

226: The reason for this aberrant behaviour is clear.  With a finite number

227: of data points, it is impossible to tell the difference between truly

228: empty space and space which is visited so rarely that no data point

229: happens to fall within it.  That is, one cannot tell the difference

230: between empty space and space that should be filled in with very low

231: probability.  These differences dramatically affect the generalised

232: dimensions~$D_q$ for $q<1$.  Thus for any experimental data set:

233: \begin{itemize}

234: 	\item  estimating~$D_q$ for $q\leq0$ is nonsense (including the Hausdorff

235: 	dimension);

236:

237: 	\item  estimates of~$D_q$ for small positive~$q$ are sensitive; and

238:

239: 	\item I only recommend the reporting of dimensions~$D_q$

240: 	for $q\geq1$ as being robust.

241: \end{itemize}

242: Out of all the generalised dimensions for order $q\geq 1$, $D_1$~is

243: most representative of the fractal as a whole.  For large order~$q$,

244: the computation of~$D_q$ is determined only by the very ``densest''

245: regions of the multi-fractal and so is not representative of the whole

246: fractal.  In the above multiplicative process,

247: \begin{displaymath}

248: 	D_q\sim-\log_3\mbox{max}(f_1,f_2,\epsilon)

249: 	\quad\mbox{as}\quad q\to\infty\,,

250: \end{displaymath}

251: showing that the large~$q$ behaviour is dictated by the one parameter

252: of the process that determines the character of the very densest

253: clusters in the fractal.  The very dense clusters occur rarely in the

254: fractal; they have low fractal dimension as seen in the low~$f$ value

255: typically associated with low values of~$\alpha$ in the multi-fractal

256: spectrum.  Because of this rareness, the computation from experimental

257: data of~$D_q$ for large positive order~$q$ is unreliable.  Then,

258: conversely, the information dimension weights the data most uniformly,

259: and so ``knows'' most about the fractal, without being overly

260: sensitive to the possible occurrence of regions of very low

261: probability.  The information dimension seems most informative.

262:

263:

264:

265:

266: \section{Fractal dimensions unbiased by finite size of data sets}

267: \label{ss3}

268: Cronin \& Roberts~\cite{Roberts95b} proposed a novel method to

269: eliminate biases, caused by finite sized data sets, in determining the

270: multi-fractal properties of a given data set.  Jelenik et

271: al.~\cite{Jelinek01, Jelinek04} used this method to explore the shape

272: of neuron cells.  The method compares characteristics of the inter-point

273: distances in the data set with those of artificially generated

274: multi-fractals.  By maximising the likelihood that the characteristics

275: are the same we model the multi-fractal nature of the data by the

276: parameters of the artificial multi-fractal.  By searching among

277: artificial multi-fractals with precisely the same number of sample

278: points as in the data, we anticipate that biases due to the finite

279: sample size will be statistically the same in the data and in the

280: artificial multi-fractals; hence predictions based upon the fitted

281: multi-fractal parameters should be unbiased by the finite sample size.

282:

283: The method also appears to give a reliable indication of the error in

284: the estimates---a very desirable feature as also noted by Judd \&

285: Mees~\cite{Judd91}.  Most importantly for this paper, I generate finite

286: size data sets with specific parameters for the following specific

287: multiplicative multi-fractal process.  Given parameters

288: $\rho\in[0,0.5]$ and $\phi\in[0,0.5]$ a binary multiplicative

289: multi-fractal is generated by the recursive procedure of dividing each

290: interval into two halves, then assigning a fraction~$\phi$ of the

291: points in the interval to a random sub-interval of length~$\rho$ in the

292: left half, and the complementary fraction $\phi'=1-\phi$ to a random

293: sub-interval of length~$\rho$ in the right half.  Such a process has

294: generalised dimension

295: \begin{equation}

296: 	D_q=\frac{\log\left(\phi^q+{\phi'}^q\right)}{(q-1)\log\rho}\,,

297: 	\label{emdq}

298: \end{equation}

299: and a multi-fractal spectrum $f(\alpha)$~\cite[\S4]{Halsey86} given

300: parameterically in terms of $0<\xi<1$ and $\xi'=1-\xi$ as

301: \begin{equation}

302: 		f = \frac{\xi\log{\xi}+\xi'\log{\xi'}}

303: 		        {\log{\rho}}

304: 		\,, \qquad

305: 		\alpha = \frac{\xi\log{\phi}+\xi'\log{\phi'}}

306: 		        {\log{\rho}} \,.

307: 	\label{emfs}

308: \end{equation}

309: Here I chose $\rho=1/3$ and $\phi=1/4$ and sample the process with

310: $N=100$; such a multi-fractal forms a finite data set whose

311: parameters we need to estimate from the sample.

312:

313: As explained in~\cite{Roberts95b}, we analyse such a sample by probing

314: it with \emph{exactly} the same multiplicative multi-fractal process,

315: and seek the best fit parameters.  Here the resulting estimate of the

316: original parameters is then in error \emph{only} due to the finite size

317: of the sample of the original multi-fractal process.  Because we fit

318: the data with a process which we know includes the one that generated

319: the data (a luxury rare in practise), there is no other

320: error.  Thus the spread in errors that we see is characteristic of only

321: the errors induced by a finite sized sample, nothing else.  In

322: particular,  observe that the deductions of the preceding section are

323: indeed appropriate.

324:

325: \begin{figure}

326: \centerline{\includegraphics[width=0.95\textwidth]{n100means}}

327: \caption{predicted multi-fractal parameters $(\rho,\phi)$, indicated

328: by~$\circ$'s, from the maximum likelihood match to an ensemble of 16

329: different realisations, each of $N=100$ data points, of a binary

330: multiplicative multi-fractal with parameters $\rho=1/3$ and

331: $\phi=1/4$, indicated by~$+$.  The mean location of the

332: predictions is indicated by a~$\times$.}

333: \protect\label{n100means}

334: \end{figure}

335:

336: I repeat the sampling of the multi-fractal followed by a maximum

337: likelihood estimate of the parameters 16~times.  Figure~\ref{n100means}

338: plots the estimates of the parameters.  Observe that the whole sampling

339: and estimation process appears unbiased in that the mean of the

340: predictions is reasonably close to the correct values of the

341: parameters.

342:

343: \begin{figure}

344: \centerline{\includegraphics[width=0.95\textwidth]{n100dqs}}

345: \caption{ensemble of multi-fractal generalised dimensions~$D_q$,

346: dotted, for each of the predictions plotted in

347: Figure~\protect\ref{n100means} made from samples of $N=100$ data

348: points.  For comparison the generalised dimensions for the actual

349: fractal is plotted as the solid line.  Observe the good estimation

350: near the information dimension, but the large errors for negative

351: order~$q$.}

352: \protect\label{n100dqs}

353: \end{figure}

354:

355: Ultimately, experimenters want to examine multi-fractal properties of

356: the data.  Here these will be determined from the parameters

357: $(\rho,\phi)$ of the best fit multi-fractal substituted into analytic

358: expressions such as (\ref{emdq})~and~(\ref{emfs}).  For each of the

359: 16~realisations and their best-fit estimates, I plot the corresponding

360: predicted generalised dimensions~$D_q$ in Figure~\ref{n100dqs}.  (The

361: corresponding graphs of the multi-fractal spectra~$f(\alpha)$ are

362: plotted in Figure~7 of~\cite{Roberts95b} along with the true

363: $f(\alpha)$~curve.)  Observe that the predicted dimensions for

364: positive~$q$ (low~$\alpha$) are quite good for all realisations,

365: especially near the information dimension,~$D_1$.  However, predicted

366: dimensions for negative~$q$ (high~$\alpha$) are very poor; this is

367: also the case for the Hausdorff dimension~$D_0$ (the maximum of the

368: $f(\alpha)$~curve).  The negative~$q$ predictions are poor despite the

369: fitting process ``knowing'' that there are no very low probability

370: regions in this artificial process.  In general applications one

371: cannot know this and I expect the negative~$q$ (large~$\alpha$)

372: predictions to be significantly worse.  These numerical results

373: convincingly support the arguments of the preceding section that we

374: should use the information dimension, not the Hausdorff.

375:

376:

377:

378:

379:

380:

381: \bibliographystyle{plain}

382: \bibliography{ajr,bib}

383:

384:

385: \end{document}

386:

387:

388: