0605:physics0605205/lee.tex

1: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%

2: %\documentclass[twocolumn,showpacs,preprintnumbers,amsmath,amssymb]{revtex4}

3: \documentclass[preprint,showpacs,preprintnumbers,amsmath,amssymb]{revtex4}

4: \usepackage{graphicx}% Include figure files

5: \usepackage{dcolumn}% Align table columns on decimal point

6: \usepackage{bm}% bold math

7: \begin{document}

8:

9: \title{Statistical Self-Similar Properties of Complex Networks}

10: \author{Chang-Yong Lee}

11: \email{clee@kongju.ac.kr}

12: \affiliation{The Department of Industrial Information, Kongju National

13:   University, Chungnam, 340-702 South Korea}

14: \author{Sunghwan Jung}

15: \affiliation{Applied Mathematics Laboratory, Courant Institute for

16:   Mathematical Science, New York University, New York, NY 10012 USA}

17: %\date{\today}

18: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%

19: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%

20: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%

21: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%

22: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%

23: \begin{abstract}

24: It has been shown that many complex networks shared distinctive

25: features, which differ in many ways from the random and the regular

26: networks. Although these features capture important characteristics of

27: complex networks, their applicability depends on the type of

28: networks. To unravel ubiquitous characteristics that complex networks

29: may have in common, we adopt the clustering coefficient as the

30: probability measure, and present a systematic analysis of various

31: types of complex networks from the perspective of statistical

32: self-similarity. We find that the probability distribution of the

33: clustering coefficient is best characterized by the multifractal;

34: moreover, the support of the measure had a fractal dimension. These

35: two features enable us to describe complex networks in a unified way;

36: at the same time, offer unforeseen possibilities to comprehend complex

37: networks.

38: \end{abstract}

39: \pacs{89.70.+c, 05.45.Df, 87.23.Ge}

40: \maketitle

41:

42: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%

43: \section{Introduction}

44: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%

45: The structure of complex systems across various disciplines can be

46: abstracted and conceptualized as networks (or graphs) of nodes and links to

47: which many quantitative methods can be applied so as to extract any

48: characteristics embedded in the system~\cite{newmansiam}.

49: Numerous complex systems have been expressed in

50: terms of networks, and have often been categorized by the research

51: field, such as social~\cite{amaral,newmansocial},

52: technological~\cite{fal,albertweb}, and biological

53: networks~\cite{guelzin,jeong}, to name a few.

54:

55: It was shown that many complex networks had distinctive

56: global features in common, including the small-world~\cite{watts} and

57: the scale-free~\cite{scale} properties. These uncovered

58: characteristics distinguish complex networks from the random and the regular

59: networks in that the average path between any two nodes is shorter

60: while maintaining highly clustered connections, and that the degree of

61: nodes approximately follows a power law distribution.

62: In addition to the global characteristics, investigations on

63: complex networks at the local level have been directed to

64: reveal local characteristics from the

65: perspective of finding patterns of interconnections among

66: nodes. Notable examples include the network motif~\cite{milo}, the

67: (dis)assortativity~\cite{assort,maslov}, and the topological modules

68: in the metabolic~\cite{ravasz} as well as the protein

69: interaction networks~\cite{maslov}.

70: Motivated by these characteristics, numerous models for the network

71: growth and evolution have been proposed to

72: understand the underlying mechanism of complex networks.

73:

74: To gain further understanding of complex networks, we investigate

75: local features of the networks from the perspective of the statistical

76: self-similarity. This may provide us with not only

77: deeper insight into complex networks, but a unified way of

78: describing them. To this end, we focus on the clustering

79: coefficient~\cite{watts} of a node $i$, defined as

80: \begin{equation}

81: C_{i}= \frac{n_{i}}{k_{i}~(k_{i}-1)}~ ,

82: \end{equation}

83: where $n_{i}$, $0 \le n_{i} \le k_{i}(k_{i}-1)$,

84: is the number of directed links (an undirected link counts twice)

85: among $k_{i}$ nearest neighbor nodes of the node $i$. It is a measure of the

86: interconnectivity among nearest neighbors of a node, or the

87: modularity~\cite{ravasz}, thus can be a quantity representing the local

88: structure of the network.

89:

90: The clustering coefficient has been analyzed from the perspective of

91: the degree correlation. It was found that the clustering coefficient

92: correlated with the degree in some complex networks. In particular,

93: there is a power law correlation between the clustering coefficient

94: and the degree for the deterministic scale-free network

95: model~\cite{doro}, the Internet~\cite{vazquez}, and metabolic

96: networks~\cite{ravasz}. It is a form of $\langle C \rangle(k) \sim

97: k^{- \delta}$, where $\langle \cdot \rangle$ represents the average

98: over the same degree, and $\delta$ depends on the type of

99: networks. However, a similar analysis of the clustering coefficient

100: reveals that the power law correlation is not manifest to other types

101: of networks, such as the protein interaction and social networks. As

102: shown in Fig.~1, the clustering coefficient of the film actor network

103: correlates with the degree not with a power-law, but exponentially (Fig.~1A);

104: while the neural network has an approximate linear correlation between the

105: clustering coefficient and the degree (Fig.~1B). Moreover, there is no

106: evident correlation in the protein networks (Fig.~1C, 1D).

107:

108: This finding suggests that in general the clustering coefficient is

109: not a simple quantity which can be related for some common ground to

110: other quantities, such as the degree. Thus, it is desirable to

111: analyze the clustering coefficient beyond the degree correlation.

112: In this paper, we focus on the systematic analysis of the clustering

113: coefficient in the wide classes of complex networks~\cite{data},

114: together with theoretical models for the complex networks, such as the

115: random~\cite{er}, the scale-free~\cite{scale}, and the small-world

116: networks~\cite{watts}.

117: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%

118: \section{Data Description}

119: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%

120: We analyzed the clustering coefficient in the complex networks of the

121: film actor network, WWW, the scientific collaboration network,

122: metabolic networks, protein interaction networks, the neural network,

123: and the Internet of both Autonomous System (AS) and the router

124: levels; together with models for the random, the scale-free, and the

125: small-world networks. For directed networks such as metabolic networks

126: and WWW, we distinguish the network into the directionality (in-degree and

127: out-degree) and carry out separate analysis. The source of the network

128: data is in Ref.~\cite{data}, and more information

129: of each network is the following. \\

130: {\bf Film actor:}

131:   An actor represents a node, and actors casted in the same movie are

132:   linked. (374511 nodes and 2445818 undirected links) \\

133: {\bf Scientific collaboration:}

134:   Each author corresponds to a node and co-authors are linked among

135:   others in the Los Alamos E-print Archive between 1995 and 1999

136:   (inclusive) in the field of the condensed matter. (13861 nodes and 89238

137:   undirected links) \\

138: {\bf Internet of Autonomous Systems (AS) level:}

139:   Each autonomous system represents a node, and a physical network connection

140:   between nodes represents a link. (6474 nodes and 25144 undirected links) \\

141: {\bf Internet of router level:}

142:   The Internet connection at the router level. The data is collected

143:   by the Mercator (\url{http://www.isi.edu/scan/mercator/}), a program

144:   that infers the router-level connectivity

145:   of the Internet. (284772 nodes and 898456 undirected links) \\

146: {\bf WWW:}

147:   World Wide Web connection network for \url{http://www.nd.edu}. Each HTML

148:   document represents a node, connected directionally by a hyperlink

149:   pointing from one document to another. It is a directed

150:   network of 325729 nodes and 1469679 directed links. \\

151: {\bf Metabolic networks:}

152:   Metabolic networks of six organisms, two for

153:   each domain, are analyzed. They are {\it Archaeoglobus fulgidus} (459

154:   nodes and 2155 directed links) and {\it Methanobacterium

155:   thermoautotrophicum} (399 nodes and 1937 directed links) in Archae;

156:   {\it Escherichia coli} (698 nodes and 3747 directed links) and {\it

157:   Salmonella typhi} (735 nodes and 3882 directed links) in Bacteria;

158:   {\it Saccharomyces cerevisiae} (511 nodes and 2690  directed links)

159:   and {\it Caenorhabditis elegans} (413 nodes and 2061 directed links)

160:   in Eukaryote. Note that the metabolic network is a directed

161:   network. \\

162: {\bf Protein interaction networks:}

163:   We have analyzed protein interaction networks of six organisms. They

164:   are {\it Saccharomyces cerevisiae} (4687 nodes and

165:   30312 undirected links), {\it Escherichia coli} (145 nodes and 388

166:   undirected links), {\it Caenorhabditis elegans} (2386 nodes and 7650

167:   undirected links), {\it Drosophila melanogaster}  (6926 nodes and

168:   41490 undirected links), {\it Helicobacter pylori} (686 nodes and

169:   2702 undirected links), and {\it Homo sapiens} (563 nodes and 1740

170:   undirected links)  \\

171: {\bf Neural network:}

172:   Somatic nervous system of {\it Nematode C. elegans} except that in

173:   the pharynz is considered. A link joins two nodes representing

174:   neurons, if they are connected by either a synapse or a gap

175:   junction. All links are treated as undirected. (265 nodes and 3664

176:   undirected links) \\

177: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%

178: \section{Multifractality of complex networks}

179: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%

180: The set of $C_{i}$ for

181: each network can be used to form a probability

182: distribution (Fig.~2). As shown in Fig.~2A-2D, the distribution

183: of $C_{i}$ in complex networks differ considerably

184: from that of the random

185: network (Fig.~2F). Probability distributions for complex networks

186: bring out high irregularity of various intensities in different

187: clustering coefficients, developing a long tail extending to either large

188: (Fig.~2A-2C) or small (Fig.~2D) values of the

189: clustering coefficient. This suggests that not a few but many, possibly

190: infinite, parameters may be needed to characterize the distribution.

191: To quantify the variation in the distribution, a continuous spectrum

192: of scaling indices has been proposed~\cite{halsey}.

193: For the spectrum that quantifies the inhomogeneity in the probability

194: distribution, we utilize the clustering coefficient as

195: the probability measure, and analyze the distribution from the

196: perspective of the statistical self-similarity, the

197: multifractal~\cite{stanley,paladin}.

198:

199: The multifractal, which is not necessarily related to geometrical

200: properties~\cite{benzi}, is a way to describe different self-similar

201: properties for different ``regions'' in an appropriate set (in our

202: case, different values of the clustering coefficient), and applied,

203: for instance, to the fully developed

204: turbulence~\cite{mandelbrot1,benzi}, which is one of the most common

205: examples of complex systems. It consists of spectra displaying

206: the range of scaling indices and their density in the set, thus has

207: been used to explain richer and more complex scaling behavior of a

208: system than the case in the critical phenomena.

209: The multifractal can be accomplished by examining the scaling

210: properties of the measure characterized by the singularity strength

211: $\alpha$ and its fractal dimension $f(\alpha)$, which

212: roughly indicates how often a particular value of $\alpha$

213: occurs~\cite{halsey}. In practice, $\alpha$ and $f(\alpha)$ are

214: customarily obtained from the Legendre transformation of $q$ and

215: $D_{q}$, via

216: \begin{equation}

217: \alpha=\frac{d}{dq} \{ (q-1) D_{q} \}~,

218: \end{equation}

219: and

220: \begin{equation}

221: f(\alpha)=q\alpha-(q-1)D_{q}~,

222: \end{equation}

223: where $D_{q}$ is the generalized correlation dimension often estimated

224: by the correlation integral method~\cite{hent}. It is the

225: quantity for anomalous scaling law whose value depends on different

226: moment $q$. $D_{q}$ is defined as

227: \begin{equation}

228: D_{q}=\lim_{R\rightarrow 0} \frac{1}{q-1} \frac{\ln S_{q}(R)}{\ln

229:   R}~,

230: \label{basic}

231: \end{equation}

232: where $S_{q}(R)$ is known as the correlation sum (or correlation

233: integral), and it is given, using Heaviside function $\Theta$, as

234: \begin{equation}

235: S_{q}(R)=\frac{1}{M}\sum_{j=1}^{M}\left\{ \frac{1}{M-1} \sum_{k=1, k\ne

236:       j}^{M} \Theta\left(R-\left|C_{j}-C_{k}\right|\right)

237:       \right\}^{q-1} ,

238: \end{equation}

239: where $M$ is the number of nodes and $C_{i}$ is the clustering

240: coefficient of node $i$.

241: The spectrum $D_{q}$ may be smoothed before transforming into $\alpha$

242: and $f(\alpha)$ to avoid the contradiction of the general

243: property of $D_{q}$, i.e., $D_{q} \le D_{q^{\prime}}~~\mbox{if}~~

244: q^{\prime} \le q$.

245:

246: There is a difficulty in taking the limit $R\rightarrow 0$ in

247: Eq.~(\ref{basic}) for a finite number of data points. Due to the

248: finiteness, there always exists the minimum distance of

249: $\left|C_{j}-C_{k}\right|$. Thus, when $R$ is less than the minimum

250: distance, the correlation sum becomes zero and no longer scales with

251: $R$. Therefore, in practice, the generalized dimension $D_{q}$ is determined by

252: plotting $\ln S_{q}(R)/(q-1)$ as a function of $\ln R$ and estimating

253: the slope within an appropriate scaling region using a least square

254: fit. The error associated with the fit can be obtained as a

255: statistical uncertainty based on fitting a straight line in the scaling region.

256:

257: Fig.~3 displays the estimated $f(\alpha)$ versus $\alpha$ for various complex

258: networks. As shown in Fig.~3A-3D, the infinitely many different

259: fractal dimensions, manifested by the shapes of $f(\alpha)$, suggest

260: that the measure is a multifractal, and thus, cannot be explained by a

261: single fractal dimension. All of the complex networks we have

262: examined, except for the neural network of {\it Caenorhabditis

263:   elegans}, form multifractals irrespective of their global

264: characteristics, such as the number of nodes and their degree

265: distributions. Furthermore, for all complex networks we have analyzed,

266: the average and standard deviation of the most probable singularity

267: strength $\alpha_{0}$, where $f(\alpha)$ takes its maximum value, are

268: $\langle \alpha_{0} \rangle = 1.2 \pm 0.3$; those of $f(\alpha_{0})$

269: are $\langle f(\alpha_{0}) \rangle =0.8 \pm 0.1$.

270:

271: The multifractal observed in complex networks implies that the

272: distribution of clustering coefficients can be described as

273: interwoven sets of singularities of strength $\alpha$, each of which

274: has its corresponding fractal dimension $f(\alpha)$~\cite{halsey}. In

275: our case, this implies that different values (or range of values)

276: of the clustering coefficient may have different fractal dimensions. From

277: the viewpoint of network dynamics in which rewiring and/or adding

278: new nodes and links are involved, the multifractal suggests that as a

279: network grows, nodes of large clustering coefficients change their

280: clustering coefficients by a factor that differs from nodes of small

281: clustering coefficients change theirs.

282:

283: The different rate of changing the clustering

284: coefficient may stem from two sources (or modes): the degree of a node

285: $k$ and the corresponding interconnectivity $n$. For a fixed $k$, the

286: clustering coefficient depends only on the interconnectivity $n$, so

287: that $C \sim n$. In this case, the dynamics (via rewiring and/or

288: adding new links) drive networks in such a way that different values

289: of $n$ are not equally probable; rather, some values of $n$ are more

290: probable than others. This assertion is further supported by the fact

291: that as $k$ increases, the number of distinct $n$ does not increase

292: quadratically in $k$, and that $n$ and $k$ are linearly correlated

293: (see below). For a fixed $n$, $C \sim k^{-2}$. Thus, the addition of

294: new links to higher degree nodes is more likely to drop their

295: clustering coefficient much faster than that of new links to lower degree

296: ones. Therefore, the dynamics of complex networks can be characterized

297: by an evolution via interplay between the two sources.

298:

299: Contrary to most complex networks, the irregularity of the

300: distribution is absent in the neural network of {\it Caenorhabditis

301:   elegans} (Fig.~2E). The estimate $D_{q}\approx 0.83 < 1$ is

302: independent of $q$, resulting in $\alpha=f(\alpha)\approx 0.83$. Thus,

303: the measure is not a multifractal, rather it can be characterized by a single

304: fractal dimension. The absence of the multifractality

305: is probably due to the biologically intrinsic property of the

306: neuron. The geometric character of the neuron imposes a

307: constraint on the number of synaptic contacts (i.e. links), leaving no

308: room for the irregularity of the distribution~\cite{amaral,neural}.

309:

310: Typically, $f(\alpha)$ satisfies $0 < f(\alpha) < D_{0}$, where $D_{0}$ is

311: the dimension of the support of the measure, which is the set of clustering

312: coefficients without their relative frequencies. We find

313: $D_{0} < 1$ for all complex networks (Fig.~3A-3D), indicating that

314: supports of the measure have fractal dimensions, just like the Cantor

315: set. This means that forbidden regions are embedded in the range of

316: allowed clustering coefficients so that some values are highly suppressed.

317: A possible cause of the suppression might stem from the correlation between

318: the degree $k$ of a node and its corresponding interconnectivity $n$.

319: A simple statistic, such as the

320: Pearson's correlation coefficient $r$, ranging $-1 < r < 1$, can be

321: used to quantify the correlation, as it reflects to what extent the two

322: quantities are linearly correlated. The result (Fig.~4A) reveals that

323: complex networks have higher linear correlation than the

324: random network. Moreover, some complex networks, such as metabolic

325: networks and the Internet of AS level, disclose strong linear correlations

326: ($r > 0.95$).

327:

328: For metabolic networks and the Internet of AS level in which the

329: degree of a node and its interconnectivity among its nearest neighbor

330: nodes are strongly correlated, we ask how the next nearest neighbor

331: nodes are interconnected, and whether the distribution of the

332: corresponding clustering coefficients maintains the

333: multifractality. To this end, we extend the definition of the

334: clustering coefficient to the next nearest nodes by including the next

335: nearest nodes for both the degree of a node $k$ and its

336: interconnectivity $n$, which is the number of links between two next

337: nearest nodes. The irregularity and the long-tail

338: characteristics are again found in the distribution of the extended

339: clustering coefficient, suggesting the existence of the

340: multifractal. As shown in Fig.~4B-4D, the probability distribution

341: of the extended version of the measure can again be characterized by

342: the multifractal, indicating that the statistical self-similarity is

343: not necessarily restricted to the local interconnectivity.

344: By including the next nearest neighbor nodes to the definition of the

345: clustering coefficient, more distinct values of the clustering

346: coefficient are possible than that of the nearest neighbor

347: nodes. This can be expected since the extended version of the

348: clustering coefficient of a node includes an average over the

349: clustering coefficients of its nearest neighbor nodes, partly

350: smoothing out the irregularity. This is also

351: manifested by the fact that $D_{0}^{\ast}$, the dimension of the

352: support of the extended clustering coefficient, is bigger than

353: corresponding $D_{0}$.

354:

355: Based on the multifractal found in complex networks, for

356: comparison, we carried out similar analysis to models of the random,

357: the scale-free, and the small-world networks.

358: In the case of the random network, the clustering coefficients are

359: smoothly distributed, by having a ``bell-shape'' (Fig.~2F). Furthermore,

360: $D_{q}\approx 1.0$ for all $q$, indicating that there is no

361: self-similarity. This can be expected since the support of the

362: measure can be regarded as a line, which is one dimensional.

363:

364: The distribution of the clustering coefficient for the scale-free network

365: shows the irregularity, similar to the case of complex networks;

366: furthermore the

367: distribution can be described by the multifractal (Fig.~3E). From

368: simulation results with various different parameters for the network,

369: however, we found that not only the most probable singularity

370: strength $\alpha_{0}$ ($0.64 < \alpha_{0} < 0.76$),

371: but the dimension of the support $D_{0}$ ($0.45 < D_{0} < 0.50$)

372: is smaller than that of complex networks, suggesting that more

373: severe restriction is imposed on the possible values of the clustering

374: coefficient. According to the model, nodes

375: of higher degree are preferred to have additional links rather than those

376: of lower degree. This preferential attachment leaves the clustering

377: coefficient of high degree nodes to decrease much faster than that of

378: low degree

379: ones. Thus when the number of nodes is doubled, for instance, the

380: clustering coefficient of high degree nodes changes by a factor

381: different from that of low degree nodes, analogous to the kinetics of

382: the diffusion-limited aggregation~\cite{meakin}.

383:

384: For the small-world network, the rewiring probability $p$ dictates

385: both the irregularity in the distribution and the multifractality. For a small

386: rewiring probability (say, $p=0.01$), the multifractal emerges

387: (Fig.~3F); however the dimension of the support is $D_{0}\approx 1.0$,

388: implying that the set of the measure entirely covers the space of the

389: clustering coefficient. As the rewiring probability increases, the range of

390: the clustering coefficients becomes smaller and the degree of inhomogeneity

391: decreases, and then the network finally becomes a random network as we can

392: easily anticipate.

393: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%

394: \section{Summary and Conclusion}

395: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%

396: Based on the irregularity of intensities in the probability

397: distribution of the clustering coefficient,

398: we regarded the clustering coefficient as the probability measure and

399: analyzed the clustering coefficient of various types of complex

400: networks from the perspective of the statistical self-similarity. We

401: found that the probability measure and the support of the measure can be

402: characterized by the multifractal and the fractal,

403: respectively. Furthermore, for complex networks having strong linear

404: correlation between the degree and the interconnectivity, the

405: multifractality extends into the clustering coefficient of the next

406: nearest neighbor nodes. These characteristics are unique to the real

407: complex networks and cannot be found in the

408: random network. From the aspect of the multifractality, models

409: of the scale-free and the small-world differ from real networks in the

410: distribution of the singularity strength $f(\alpha)$.

411:

412: The statistical self-similarity in the distribution of the clustering

413: coefficient can be served as a general characteristic of complex

414: networks; at the same time, giving a further insight into the

415: understanding of complex networks. The multifractality shared by

416: different complex networks suggests that a similar law may govern

417: the diverse complex networks of nature as well as artificiality.

418: Furthermore, it can be used to classify the complex networks, and

419: serves as a ``touchstone'' of proposed models for complex networks.

420: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%

421: \begin{acknowledgments}

422: We like to thank M. Newman for providing us with the scientific

423: collaboration data. We also appreciate the open source of various

424: complex network data available at

425: \url{http://www.nd.edu/~networks/}. This work was supported by the

426: Korea Research Foundation Grant funded by the Korean Government (MOEHRD)

427: (KRF-2005-041-H00052).

428: \end{acknowledgments}

429: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%

430: \begin{thebibliography}{99}

431: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%

432: \bibitem{newmansiam} For a review of the network theory, see, for

433:   example, M. Newman, SIAM Review {\bf 45} 167 (2003), and

434:   R. Albert and A.-L. Barab{\' a}si, Rev. Mod. Phys. {\bf 74},

435:   47 (2002).

436: \bibitem{amaral} L. Amaral, A. Scala, M. Barthelemy, and H. E. Stanley,

437:   Proc. Natl. Acad. Sci. USA {\bf 97}, 11149 (2000).

438: \bibitem{newmansocial} M. Newman, Proc. Natl. Acad. Sci. USA {\bf 98},

439:   404 (2001).

440: \bibitem{fal} M. Faloutsos, P. Faloutsos, and C. Faloutsos,

441:   Comput. Commun. Rev. {\bf 29}, 251 (1999).

442: \bibitem{albertweb} R. Albert, H. Jeong, and A.-L. Barab{\' a}si, Nature

443:   (London) {\bf 401}, 130 (1999).

444: \bibitem{guelzin} N. Guelzim, S. Bottani, P. Bourgine, and F. Kepes,

445:   Nature Genetics {\bf 31}, 60 (2002).

446: \bibitem{jeong} H. Jeong, B. Tombor, R. Albert, Z. Litvai, and

447:   A.-L. Barab{\' a}si, Nature (London) {\bf 407}, 651 (2000).

448: \bibitem{watts} D. Watts and S. Strogatz, Nature (London) {\bf 393},

449:   440 (1998).

450: \bibitem{scale} A.-L. Barab{\' a}si and R. Albert, Science {\bf 286},

451:   509 (1999).

452: \bibitem{milo} R. Milo {\it et al}, Science {\bf 298}, 824 (2002).

453: \bibitem{assort} M.E.J. Newman, Phys. Rev. Letts. {\bf 89}, 208701 (2002).

454: \bibitem{maslov} S. Maslov and K. Sneppen, Science {\bf 296}, 910 (2002).

455: \bibitem{ravasz} E. Ravasz, A. L. Somera, D. A. Mongru, Z. N. Oltvai,

456:   and A.-L. Barab{\' a}si, Science {\bf 297}, 1551 (2002); E. Ravasz

457:   and A.-L. Barab{\' a}si, Phys. Rev. E {\bf 67}, 026112 (2003).

458: \bibitem{doro} S. N. Dorogovtsev, A. V. Goltsev, and J. F. F. Mendes,

459:   Phys. Rev. E {\bf 65}, 066122 (2002).

460: \bibitem{vazquez} A. V{\' a}zquez, R. Pastor-Satorras, and

461:   A. Vespignani, Phys. Rev. E {\bf 65}, 066130 (2002).

462: \bibitem{data} The sources for various types of network data are the

463:   following. Film actors, WWW, and metabolic networks were obtained from

464:   \url{http://www.nd.edu/~networks/}; the scientific

465:   collaboration data was provided by M. Neuman; the Internet of

466:   Autonomous Systems level was obtained from

467:   \url{http://moat.nlanr.net/Routing/rawdata/}; the Internet of

468:   router level is collected by the Mercator and is available at

469:   \url{http://www.isi.edu/scan/mercator/}; protein interaction

470:   networks data are available at \url{http://dip.doe-mbi.ucla.edu};

471:   the somatic nervous system of {\it Nematode C. elegans} was obtained from

472:   \url{http://ims.dse.ibaraki.ac.jp/research/}.

473: \bibitem{er} P. Erd{\"o}s and A. R{\' e}nyi,

474:   Pulb. Math. Inst. Hung. Acad. Sci. {\bf 5}, 17 (1960); B. Bollob{\'

475:   a}s, Random Graphs (Academic Press, London, 1985).

476: \bibitem{halsey} T.C. Halsey, M.H. Jensen, L.P. Kadanoff, I. Procaccia, and

477:   B.I. Shraiman, Phys. Rev. A {\bf 33}, 1141 (1986).

478: \bibitem{stanley} H.E. Stanley and P. Meakin, Nature (London) {\bf

479:     335}, 405 (1988).

480: \bibitem{paladin} G. Paladin and A. Vulpiani, Physics Reports {\bf

481:     156}, 147 (1987).

482: \bibitem{benzi} R. Benzi, G. Paladin, G. Parisi, and A. Vulpiani,

483:   J. Phys. A: Math. Gen. {\bf 19}, 823 (1986).

484: \bibitem{mandelbrot1} B.B. Mandelbrot,  J. Fluid Mech. {\bf 62},

485:   331 (1974).

486: \bibitem{hent} P. Grassberger and I. Procaccia, Phys. Rev. Lett. {\bf

487:   50}, 346 (1983); H. Hentschel and I. Procaccia, Physica (Amsterdam)

488:   {\bf 8D}, 435 (1983); K. Pawelzik and H. G. Schuster, Phys. Rev. A

489:   {\bf 35}, R481 (1987).

490: \bibitem{neural} J. White, E. Southgate, J. Thomson, and

491:   S. Brenner, Phil. Trans. R. Soc. London B {\bf 314}, 1 (1986).

492: \bibitem{meakin} P. Meakin, A. Coniglio, H.E. Stanley, and T.A. Witten,

493:   Phys Rev. A {\bf 34}, 3325 (1986).

494: \end{thebibliography}

495: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%

496: \newpage

497: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%

498: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%

499: \begin{figure}

500: \centerline{

501: \includegraphics{fig1}

502: }

503: \caption{Plots of the clustering coefficient averaged over all nodes

504:   of the same degree versus the degree for selected complex

505:   networks. (A) the film actor network, (B) the neural network of {\it

506:   Caenorhabditis elegans}, (C) the protein interaction network of {\it

507:   Helicobacter pylori}, (D) the protein interaction network of {\it

508:   Homo sapiens}. Note that the abscissa of (A), and the abscissa as

509:   well as the ordinate of (C) and (D) are in log scale.}

510: \end{figure}

511: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%

512: \begin{figure}

513: \centerline{

514: \includegraphics{fig2}

515: }

516: \caption{Probability distributions of the clustering coefficient for

517:   selected complex networks as examples. (A) the film actor network, (B)

518:   the protein interaction network for {\it Caenorhabditis elegans}, (C) the

519:   metabolic network (in-degree) of {\it Escherichia coli}, (D) the

520:   scientific collaboration network, (E) the neural network, and (F)

521:   the random network of 2000 nodes with the connection probability

522:   $p=0.1$. Note that ordinates of (A)-(D) are in log scale for

523:   the display purpose.}

524: \end{figure}

525: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%

526: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%

527: \begin{figure}

528: \centerline{

529: \includegraphics{fig3}

530: }

531: \caption{$f(\alpha)$ versus $\alpha$ for selected complex networks as

532:   examples. Error-bars are the root mean square in estimating

533:   $f(\alpha)$, and $D_{0}$ (the maximum of $f(\alpha)$) is the

534:   dimension of the support of the measure. (A) WWW (in-degree), (B)

535:   the scientific collaboration network (cond-mat), (C) the

536:   metabolic network (out-degree) of {\it Escherichia coli}, (D) the

537:   protein interaction network of {\it Saccharomyces

538:   cerevisiae}, (E) the model of the scale-free network with 10000 nodes,

539:   (F) the small-world model of the

540:   rewiring probability $p=0.01$.}

541: \end{figure}

542: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%

543: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%

544: \begin{figure}

545: \centerline{

546: \includegraphics{fig4}

547: }

548: \caption{(A): The linear correlation coefficients $r$ of complex networks

549:   plotted against the number of nodes are compared with the

550:   random networks of different numbers of nodes: the

551:   metabolic networks ($\Box$), the Internet of AS level ($\triangle$),

552:   the scientific collaboration network ($\triangledown$), protein

553:   interaction networks ($\bigcirc$), the actor network

554:   ($\blacktriangle$), WWW ($\blacktriangledown$), the Internet of

555:   router level ($\bullet$), and the random network

556:   ($\blacksquare$). For random networks,

557:   each node has on average 10 links. (B)-(D): $f(\alpha)$ versus

558:   $\alpha$ for selected organisms in the metabolic networks and the Internet of

559:   the AS level. $D^{\ast}_{0}$ represents the dimension of the support

560:   for the extended clustering coefficient. (B) the

561:   metabolic network of {\it Archaeoglobus fulgidus} (in-degree), (C) the

562:   metabolic network of {\it Caenorhabditis elegans} (in-degree), (D)

563:   the Internet of AS level. Note that $D^{\ast}_{0}$ for (B)-(D) are larger

564:   than corresponding $D_{0}$ for the case of the nearest neighbor,

565:   where, from 3B to 3D, $D_{0}=0.79, 0.73, 0.87$, respectively.}

566: \end{figure}

567: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%

568: \end{document}

569: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%

570: