0609:cond-mat0609264/file.tex

1: \documentclass[twocolumn,showpacs,preprintnumbers,amsmath,amssymb]{revtex4}

2:

3:

4: \usepackage{graphicx}% Include figure files

5: \usepackage{dcolumn}% Align table columns on decimal point

6: \usepackage{bm}% bold math

7:

8: \begin{document}

9:

10:

11:

12: \title{Universal Scaling Behavior of Clustering Coefficient\\ Induced by Deactivation Mechanism}

13: \author{Liang Tian}

14: \author{Chen-Ping Zhu}

15:  \email{chenpingzhu@yahoo.com.cn}

16: \author{Da-Ning Shi}

17:  \email{shi@nuaa.edu.cn}

18: \author{Zhi-Ming Gu}

19: \affiliation{College of Science, Nanjing University of Aeronautics

20: and Astronautics, Nanjing, 210016, PR China}

21: \author{Tao Zhou}

22: \affiliation{Nonlinear Science Center and Department of Modern

23: Physics, University of Science and Technology of China, Hefei,

24: Anhui, 230026, PR China}

25:

26: \date{\today}

27:

28: \begin{abstract}

29: We propose a model of network growth that generalizes the

30: deactivation model previously suggested for complex networks.

31: Several topological features of this generalized model, such as the

32: degree distribution and clustering coefficient, have been

33: investigated analytically and by simulations. A scaling behavior of

34: clustering coefficient $C \sim 1/M$ is theoretically obtained, where

35: $M$ refers to the number of active nodes in the network. We discuss

36: the relationship between the recently observed numerical behavior of

37: clustering coefficient in the coauthor and paper citation networks

38: and our theoretical result. It shows that both of them are induced

39: by deactivation mechanism. By introducing a perturbation, the

40: generated network undergoes a transition from large- to small-world,

41: meanwhile the scaling behavior of $C$ is conserved. It indicates

42: that $C \sim 1/M$ is a universal scaling behavior induced by

43: deactivation mechanism.

44: \end{abstract}

45:

46: \pacs{89.75.Hc, 87.23.Ge, 89.65.-s, 89.75.Fb}

47:

48: \maketitle

49:

50:

51: \section{INTRODUCTION}

52: Many social, biological, and communication systems can be properly

53: described as complex networks with nodes representing individuals or

54: organizations and links mimicking the interactions among them[1-3].

55: Examples are numerous: these include the Internet[4,5], the World

56: Wide Web[6,7], biological networks[8,9], food webs[10], social

57: networks[11], etc. Recent empirical studies indicate that the

58: networks in various fields exhibit some common topological

59: characteristics: a small average distance as random networks, large

60: clustering coefficient as regular networks (small-world

61: property)[12] and a power-law degree distribution (scale-free

62: property)[13]. The ubiquity of complex networks has inspired

63: tremendous investigations on them. Among these flourishing

64: researches, the effect of aging is of particular interest[14-18],

65: since it is a universal mechanism in reality. For instance, in the

66: movie actor collaboration network, the more famous an actor is, the

67: more chances he will have to act in new movies. But, no matter how

68: famous he may be, every star will become gradually inactive as time

69: goes on. This aging effect can greatly influence the evolution of

70: networks and results in peculiar network structural property[14,15].

71:

72: Recently, B\"orner \emph{et al}. introduced a general process model

73: that simultaneously grows coauthor and paper citation networks[19],

74: in which the core assumption is that the twin networks of scientific

75: researchers and academic articles mutually support one another. In

76: their model, each of the authors and papers is assigned a topic, and

77: authors read, cite, produce papers or coauthor with others only in

78: their own topic area. Interestingly, they found that the clustering

79: coefficient $C$ of the simulated paper citation network is linearly

80: correlated with the number of topics. We note that the main

81: underlying dynamic rule governing the evolution of the network is

82: aging. For example, due to the lifespan of human, once authors are

83: older than a specified age, they will be set deactivated, and do not

84: produce papers or coauthor with others any longer. Furthermore,

85: papers cease to receive links when their contents are outdated.

86: Therefore, these considerations motivate us to theoretically

87: investigate the effect of aging on the clustering coefficient of the

88: network. In the present paper, we concentrate on this ingredient of

89: self-organization of the coauthor and paper citation networks and

90: propose a simple generalized model, in which the main dynamic is

91: deactivation mechanism. We will demonstrate that the behavior of

92: clustering coefficient $C$ in the coauthor and paper citation

93: networks is universal in networks generated by deactivation

94: mechanism.

95:

96: This paper is organized as follows. In section II, the model is

97: introduced. In section III, we give both the numerical and analytic

98: results about the effect of deactivation mechanism on network

99: structure, including degree distribution (Sec. III A) and clustering

100: coefficient (Sec. III B). An interesting scaling behavior of $C$ is

101: obtained. In section IV, a structural perturbation is introduced. We

102: show that the perturbation leads to a structural transition from

103: large- to small-world (Sec. IV A), while the scaling behavior of $C$

104: is conserved (Sec. IV B). Finally in section V, we discuss the

105: relationship between our result and the behavior of clustering

106: coefficient in the coauthor and paper citation networks and give a

107: summary .

108:

109: \section{THE MODEL}

110: In the present model, each node can be in two different states:

111: active or inactive[18,20,21]. The evolution process starts with a

112: one-dimensional lattice consisting of $M$ active nodes with periodic

113: boundary condition and coordination number $2z$[22]. Then, in each

114: time step

115:

116: \

117:

118: (1) Add a new node into the network, and connect it to $m$ nodes

119: randomly chosen from the $M$ active ones.

120:

121: (2) Activate the new node.

122:

123: (3) Deactivate one of the active nodes. The probability that the

124: node $i$ is deactivated is given by

125: \begin{eqnarray}\pi(k_i)=\frac{\alpha}{k_i},\end{eqnarray}

126: where the normalization factor is defined as

127: $\alpha=(\sum_{j\in\mathcal{A}}1/k_j)^{-1}$. The summation runs over

128: the set $\mathcal{A}$ of the currently $M+1$ active nodes.

129:

130: \

131:

132: It is worthwhile to note that, when $M=m$ and $z=[\frac{M}{2}]$, the

133: present model reduces to the famous deactivation model introduced by

134: Klemm and Egu\'iluz (KE model)[18]. For convenience, we call this

135: generalized deactivation model GKE model.

136:

137: \section{STRUCTURAL PROPERTIES}

138: \subsection{Degree distribution}

139: By using the continuous approximation similar to that used in Ref.

140: [18], the degree distribution $P(k)$ can be obtained analytically

141: for GKE model. Let us first derive the degree distribution

142: $p^{(t)}(k)$ of the active nodes at time $t$. It evolves according

143: to the following master equation:

144: \begin{eqnarray}p^{(t+1)}(k+1)&=&p^{(t)}(k)\frac{m}{M}[1-\pi(k)] \nonumber \\ &+&p^{(t)}(k+1)[1-\frac{m}{M}][1-\pi(k+1)].\end{eqnarray}

145: On the right side of Eq. (2), the first term accounts for the

146: process in which an active node with degree $k$ at time $t$ is

147: connected to the new node and not deactivated in the next time step;

148: The second term indicates the process that an active node with

149: degree $k+1$ at time $t$ is not connected to the new node and still

150: active in the next time step.

151:

152: \begin{figure}

153: \scalebox{0.9}[0.9]{\includegraphics{fig1}} \caption{Illustration of

154: the normalization factor $\alpha$ as a function of time $t$ with the

155: parameters $m=10$ and $M=30$. The amplified version can be seen in

156: the inset. The data points correspond to system size

157: $N=2\times10^4$, and each is obtained as an average of $100$

158: independent runs.}

159: \end{figure}

160:

161: We investigate the behavior of $\alpha$ in time evolution. Fig. 1

162: shows the dependence of the normalization factor $\alpha$ on time

163: $t$. We find that $\alpha$ approaches a stable value with certain

164: fluctuations as soon as the evolution of the network starts. We

165: assume that the fluctuations of the normalization factor $\alpha$

166: are small enough, i. e., it can be treated as a constant. Then, the

167: stationary case $p^{(t+1)}(k)=p^{(t)}(k)$ of Eq. (2) yields

168: \begin{eqnarray}p(k+1)-p(k)=\frac{-\alpha-(\gamma-1)k}{k(k+\gamma-\alpha)}p(k),\end{eqnarray}

169: where $\gamma=\frac{\alpha M}{m}+1$. Treating $k$ as continuous we

170: write down the equation

171: \begin{eqnarray}\frac{dp}{dk}=\frac{-\alpha-(\gamma-1)k}{k(k+\gamma-\alpha)}p(k),\end{eqnarray}

172: which yields the solution

173: \begin{eqnarray}p(k)\sim k^{-\gamma+1}.\end{eqnarray}

174: When the system size $N$ is large compared with $M$, the degree

175: distribution of the whole network $P(k)$ can be approximated by

176: considering the inactive nodes only. Thus $P(k)$ can be calculated

177: as the rate of the change of the degree distribution $p(k)$ of the

178: active nodes. We find

179: \begin{eqnarray}P(k)=-\frac{dp}{dk}=ck^{-\gamma},\end{eqnarray} where

180: $c=(\gamma-1)m^{\gamma-1}$ is the normalization constant. Finally,

181: the exponent $\gamma=3$ is obtained from a self-consistent condition

182: \begin{eqnarray}2m=\int_m^\infty k P(k)dk.\end{eqnarray}

183:

184: The exponent $\gamma$ can be tunable if we introduce the initial

185: attractiveness just like that of the model in Ref. [18]. Since it is

186: not our focus, we will not show this effect here.

187:

188: In Fig. 2, we plot the cumulative degree distribution of GKE

189: networks by simulations. We obtain a power law scaling with

190: best-fitted exponent $\gamma-1=1.96\pm0.02$, which is in agreement

191: with the analytical result. In fact, the exponent $\gamma$ is

192: dependent on $m$[20], which can be ignored when $m$ is large.

193: However, the number of active nodes $M$ has no effect on degree

194: distribution exponent $\gamma$, which is analytically and

195: numerically obtained.

196:

197: \begin{figure}

198: \scalebox{0.8}[0.8]{\includegraphics{fig2}} \caption{(Color online)

199: Cumulative degree distribution of GKE networks with parameters (a)

200: $M=20$; $m=4$ (squares), $8$(upward triangles), $14$ (downward

201: triangles), $20$ (circles) and (b) $m=20$; $M=20$ (squares),

202: $40$(upward triangles), $80$ (downward triangles), $100$ (circles).

203: The data points correspond to system size $N=2\times10^4$, and each

204: is obtained as an average of $100$ independent runs. The two dash

205: lines have slope $-2.0$ for comparison.}

206: \end{figure}

207:

208: \subsection{Clustering coefficient}

209: The clustering coefficient $C(l)$ of node $l$ with degree $k_l$ can

210: be defined as follow:

211: \begin{eqnarray}C(l)=\frac{2E(l)}{k_l(k_l-1)},\end{eqnarray} where $E(l)$ is the number

212: of links between neighbors of node $l$.

213:

214: According to the definition of the GKE model, when a new node with

215: $m$ links is added into the network, the links are attached to the

216: nodes randomly selected from the active ones. Thus, the probability

217: that two arbitrary active nodes are connected is $\frac{m}{M}$. It

218: follows that a node $l$ with degree $k_l=m$ has

219: \begin{eqnarray}E(l)=\frac{m}{M}\frac{k_l(k_l-1)}{2}.\end{eqnarray}

220: If $l$ is deactivated in the time step of its generation its

221: neighborhood does not change any more and $C(l)$ keeps stable.

222: Otherwise, node $l$ is not deactivated. In the next time step, a new

223: node $j$ is added. As we note, the probability that node $j$ makes

224: connection to $l$ is equal to the probability that one of the

225: neighbors of node $l$ is deactivated in the last time step. We

226: assume that if $k_l$ is added by $1$, one of its active neighbors

227: has already been deactivated in the last time step. Thus, when the

228: newly added node is connected to node $l$, one of its neighbors $s$

229: is inactive and one possible link between the newly added node and

230: $s$ is missed. Then we have

231: \begin{eqnarray}E(l)=\frac{m}{M}[\frac{k_l(k_l-1)}{2}-1],\end{eqnarray}

232: where $k_l=m+1$. Also, if $k_l=m+2$, there will be $2$ inactive

233: nodes in the neighbors of node $l$ causing another $2$ possible

234: links to be missed. Thus we obtain

235: \begin{eqnarray}E(l)=\frac{m}{M}[\frac{k_l(k_l-1)}{2}-1-2],\end{eqnarray}

236: where $k_l=m+2$. This process repeats until node $l$ is deactivated,

237: whose neighborhood does not change any more. By induction, we have

238: \begin{eqnarray}E(l)=\frac{m}{M}[\frac{k_l(k_l-1)}{2}-\sum_{\nu=1}^{k_l-m}\nu].\end{eqnarray}

239: Thus the clustering coefficient $C(l)$ depends only on the degree

240: $k_l$. The exact relation is

241: \begin{eqnarray}C(l)=\frac{m}{M}[1-\frac{(k_l-m+1)(k_l-m)}{k_l(k_l-1)}].\end{eqnarray}

242: The clustering coefficient $C$ of the whole network is the average

243: of $C(l)$ over all nodes, i. e.,

244: \begin{eqnarray}C=\frac{1}{N}\sum_{l=1}^N \frac{m}{M}[1-\frac{(k_l-m+1)(k_l-m)}{k_l(k_l-1)}].\end{eqnarray}

245: Writing Eq. (14) in continuous form yields

246: \begin{eqnarray}C=\int_m^\infty \frac{m}{M}[1-\frac{(k-m+1)(k-m)}{k(k-1)}]P(k)dk,\end{eqnarray}

247: where $P(k)$ is the degree distribution which we have derived above.

248: Finally, the result is

249: \begin{eqnarray}C=\frac{1}{M}[\frac{5m}{6}-\frac{7}{30}+\mathcal{O}(m^{-1})].\end{eqnarray}

250: Obviously, when $M=m$, the clustering coefficient of the KE model is

251: recovered[23].

252:

253: From Eq.(16), we know that the clustering coefficient $C$ is

254: independent of the system size $N$. This asymptotic behavior of $C$

255: is reported in Fig. 3. In the limit of large $N$, the clustering

256: coefficient $C$ gets to an stationary value of $0.31$, which agrees

257: with the analytical result.

258:

259: It is important to point out that the clustering coefficient has an

260: novel scaling behavior $C\sim1/M$. Extensive numerical simulations

261: perfectly confirm this result (see Fig. 4). This behavior can be

262: related to the recent numerical study on the coauthor and paper

263: citation networks, which will be discussed in Sec. V.

264:

265: \begin{figure}

266: \scalebox{0.9}[0.9]{\includegraphics{fig3}} \caption{Illustration of

267: the average clustering coefficient $C$ as a function of system size

268: $N$ with the parameter $m=4$ and $M=10$. The clustering coefficient

269: $C$ approaches a stationary value about 0.31, which is precisely

270: predicted by Eq. (16). Each data point is obtained as an average of

271: $1000$ independent runs.}

272: \end{figure}

273:

274: \begin{figure}

275: \scalebox{0.9}[0.9]{\includegraphics{fig4}} \caption{(Color online)

276: The analytical result of the clustering coefficient of GKE network,

277: $C(M)$, as a function of $M$ (line), in comparison with the

278: simulation (circles) results. Other parameters for the simulation

279: are $m=10$ and $N=2\times10^4$. Each data point is obtained as an

280: average of $100$ independent runs.}

281: \end{figure}

282:

283: \section{STRUCTURAL PERTURBATION}

284: \subsection{Structural transition}

285: We introduce a structural perturbation to the GKE model by modifying

286: step (1) of the definition as follow: Add a new node with $m$ links

287: to the network. With probability $p$, attach {\em one} of the new

288: node's links to a randomly selected {\em inactive} node. The other

289: links are then attached to nodes chosen randomly from the $M$ {\em

290: active} ones. We will show that the perturbation will lead to a

291: phase transition[24] from large- to small-world in the network

292: without changing the scale-free property.

293:

294: In GKE model, each node can be represented by the time step of its

295: generation. It is clear that, when $p=0$ the GKE network is

296: structured[18], i. e., the time ordering exists and the mean field

297: manner is absent[20,25]. We denote $l(t)$ as the average distance

298: for pairs of nodes separated by time interval $t$. Fig. 5 shows the

299: simulation results of the variation of $l(t)$ with perturbation

300: parameter $p$. It can be found that, when $p=0$, $l(t)$ increases

301: linearly with $t$, i. e., the time ordering indeed exists. Since the

302: nodes in the network are uniformly distributed on time axis, we can

303: easily obtain that the average distance $L$ is linearly correlated

304: to the system size $N$, i. e., $L\propto N$, which indicates the

305: absence of small world effect. However, once $p$ is a small finite

306: value, $l(t)$ becomes independent of time interval $t$, i. e., the

307: time ordering vanishes. Meanwhile, all nodes with the same degree

308: can be considered to be statistically equivalent, and the mean-field

309: manner is recovered.

310:

311: \begin{figure}

312: \scalebox{0.9}[0.9]{\includegraphics{fig5}} \caption{(Color online)

313: Illustration of $l(t)$ as a function of time interval $t$, with

314: perturbation $p=0.00$ (solid line), $0.01$ (dashed line), $0.05$

315: (dotted line), and $0.10$ (dot-dashed line). Other parameters for

316: the simulations are $m=3$, $M=10$, and $N=8000$. Each data point is

317: obtained as an average of $50$ independent runs.}

318: \end{figure}

319:

320: Let $d(i,j)$ denotes the distance between node $i$ and node $j$, and

321: thus the average distance of the model with system size $N$ is

322: \begin{eqnarray}L(N)=\frac{2\sigma(N)}{N(N-1)}.\end{eqnarray}

323: where the total distance is

324: \begin{eqnarray}\sigma(N)=\sum_{1\leq i<j\leq N } d(i,j).\end{eqnarray}

325: Intuitively, when a new node is added, the distance between old

326: nodes will not increase. Hence we have

327: \begin{eqnarray}\sigma(N+1)\leq\sigma(N)+\sum_{i=1}^N d(i,N+1),\end{eqnarray}

328: thus

329: \begin{eqnarray}\sigma(N+1)\leq\sigma(N)+\sum_{i=1}^N d(i,x)+N,\end{eqnarray}

330: where $x$ is the active node connected to the newly added one. Since

331: $p$ is nonzero, by using mean-field approximation[26,27], we have

332: \begin{eqnarray}\sum_{i=1}^N d(i,x)\approx L(N)(N-1).\end{eqnarray}

333: Thus, the inequality (20) reduces to

334: \begin{eqnarray}\sigma(N+1)\leq\sigma(N)+\frac{2\sigma(N)}{N}+N.\end{eqnarray}

335: Rewriting (22) in continuous form will yield

336: \begin{eqnarray}\frac{d\sigma(N)}{dN}\leq\frac{2\sigma(N)}{N}+N,\end{eqnarray}

337: which leads to

338: \begin{eqnarray}\sigma(N)\leq N^2\ln N+B,\end{eqnarray}

339: where $B$ is a constant. As $\sigma(N)\sim N^2L(N)$ and $N$ is

340: sufficiently large, we obtain $L(N)\leq\ln N$, i. e., the increasing

341: tendency of $L(N)$ is not faster than $\ln N$, which predicts the

342: presence of small-world property.

343:

344: In fact, the GKE network is similar to a chain of dense clusters

345: locally connected, i. e., it is like a regular lattice in

346: topological view. For this peculiar topology, all of the links in

347: the network are local. When a perturbation is introduced, the

348: network undergoes a cross-over from structured network to

349: unstructured network. Actually, the perturbation just means that,

350: with a probability, every node rewires one of its local links to a

351: randomly selected node, which is precisely the definition of the

352: model proposed by Watts and Strogatz[12]. That is to say, the

353: cross-over is just the small-world phase transition[24].

354:

355: In Fig. 6, we show the dependence of average distance $L$ on system

356: size $N$ with $p=0.00$ and $p=0.01$ in GKE network. For $p=0.00$,

357: the average distance grows linearly $L\propto N$, the same behavior

358: observed in one-dimensional regular lattices. Once $p$ is a small

359: finite value, $L$ becomes logarithmic related to $N$, i. e.,

360: $L\propto \ln N$. The logarithmic increase of average distance with

361: system size predicts that the phase transition from large- to

362: small-world occurs, which is in agreement with the analytical

363: result.

364:

365: \begin{figure}

366: \scalebox{0.9}[0.9]{\includegraphics{fig6}} \caption{(Color online)

367: Illustration of the average distance $L$ as a function of $N$, with

368: $p=0.00$ (squares) and $p=0.01$ (circles). When a perturbation

369: $p=0.01$ is introduced, $L$ grows logarithmically with $N$. The

370: values can be fitted well by a straight line, which is typical of

371: the small world effect. Other parameters for these simulations are

372: $m=4$ and $M=10$. Each data point is obtained as an average of $100$

373: independent runs. }

374: \end{figure}

375:

376: \

377:

378: It should be noted that, although we introduce a structural

379: perturbation into the network, the scale-free property is not

380: affected and the power-law exponent $\gamma=3$ is maintained.

381: Numerical simulations shown in Fig. 7 confirm this feature.

382:

383: \begin{figure}

384: \scalebox{0.9}[0.9]{\includegraphics{fig7}} \caption{(Color online)

385: Illustration of the cumulative degree distribution of the GKE

386: network with perturbation $p=0.00$ (squares), $0.01$ (upward

387: triangles), and $0.10$ (circles). The fitted power-law exponent is

388: $\gamma-1=1.97\pm0.02$. Other parameters for these simulations are

389: $m=10$, $M=20$, and $N=2\times10^4$. Each data point is obtained as

390: an average of $100$ independent runs. The dashed line has slope

391: $-2.0$ for comparison.}

392: \end{figure}

393:

394: \subsection{Universal scaling behavior of clustering coefficient}

395: In the following subsection, we investigate the dependence of

396: clustering coefficient $C$ on perturbation parameter $p$. Analogous

397: to the derivation of clustering coefficient in GKE network without

398: perturbation, we give an approximately analytical result. According

399: to the modification of the model, when a new node $l$ with $m$ links

400: is added into the network, one of the links is attached to a

401: randomly selected inactive node $s$ with probability $p$. That is to

402: say, with probability $p$, one of the neighbors of $l$ is inactive.

403: Since the system size $N$ is large compared with $M$, we assume that

404: node $s$ is apart from the active nodes[28]. Thus, $m-1$ possible

405: links between neighbors of $l$ are missed. Furthermore, node $s$ is

406: always apart from the afterward added nodes that are connected to

407: node $l$, which causes another $k-m$ possible links missed. Thus we

408: have

409: \begin{eqnarray}C(l)&=&\frac{m-1\times p}{M}[1-\frac{(k_l-m+1)(k_l-m)}{k_l(k_l-1)}]\nonumber\\&-&p\frac{m-1\times p}{M}\frac{2(m-1)}{k_l(k_l-1)}\nonumber\\&-&p\frac{m-1\times p}{M}\frac{2(k_l-m)}{k_l(k_l-1)}.\end{eqnarray}

410: Similar to the derivation of Eq. (16), we have

411: \begin{eqnarray}C=\frac{m}{M}(\frac{5}{6}-\frac{7}{30m})-\frac{1}{M}(\frac{13}{6}-\frac{7}{30m})p+\mathcal{O}(p^2)\end{eqnarray}

412:

413: It is worthwhile to note that the scaling behavior $C\sim1/M$ is

414: conserved though there exist certain fluctuations in the network

415: which lead to a structure transition. That is to say, $C\sim1/M$ is

416: a universal scaling behavior of clustering coefficient induced by

417: deactivation mechanism. Fig. 8 shows the log-log plot of the

418: clustering coefficient $C$ versus $M$ with different perturbation

419: parameters obtained by simulations. We can see that the perturbation

420: has almost no effect on the scaling behavior of $C$, which agrees

421: well with the analytical result.

422:

423: From Eq. (26), we know that, when $p$ is sufficiently small, the

424: clustering coefficient $C$ has a linearly relation with $p$. Fig. 9

425: shows the simulation result of clustering coefficient $C$ as a

426: function of perturbation parameter $p$, with $m=4$ and $M=10$. The

427: slope found numerically is 0.223, slightly larger than the

428: analytical result $\frac{1}{M}(\frac{13}{6}-\frac{7}{30m})=0.21$.

429: The deviation is due to the approximation[28] used in the

430: theoretical derivation of $C$. It is clear that the node $s$ is not

431: always apart from all the active nodes, which actually causes less

432: than $m-1$ possible links missed between neighbors of $l$. Thus, we

433: can easily find the precise slope should be a little larger than

434: that obtained from Eq. (26).

435:

436:

437:

438: \begin{figure}

439: \scalebox{0.9}[0.9]{\includegraphics{fig8}} \caption{(Color online)

440: Illustration of the clustering coefficient $C$ as a function of

441: number of active nodes $M$, with perturbation parameter $p=0.00$

442: (squares), $p=0.01$ (upward triangles), $p=0.10$ (downward

443: triangles), $p=0.50$ (diamonds), and $p=1.00$ (circles). The average

444: fit slope for the simulations is $0.994$. Other parameters for these

445: simulations are $m=10$ and $N=2\times10^4$. Each data point is

446: obtained as an average of $100$ independent runs. The dashed line

447: has slope $-1.0$ for comparison.}

448: \end{figure}

449:

450: \begin{figure}

451: \scalebox{0.9}[0.9]{\includegraphics{fig9}} \caption{(Color online)

452: Illustration of the clustering coefficient $C$ as a function of

453: perturbation parameter $p$. The fit slope is $0.223$. Other

454: parameters for this simulation are $m=4$, $M=10$, and

455: $N=2\times10^4$. Each data point is obtained as an average of $100$

456: independent runs.}

457: \end{figure}

458:

459:

460:

461: \section{DISCUSSIONS AND CONCLUSIONS}

462: We first discuss the relationship between the numerical behavior of

463: clustering coefficient in the coauthor and paper citation networks

464: and our theoretical result. According to the model of Ref. [19],

465: each of the authors and papers is assigned a topic, and authors can

466: only cite, produce papers or coauthor with others in their own topic

467: area. It means that, by topics the whole network is divided into

468: many subnetworks which evolve separately and simultaneously. Each of

469: the subnetworks can be reduced to a GKE network and the number of

470: these GKE subnetworks is just the number of the topics denoted as

471: $n$. In each subnetwork, the number of \emph{active} authors who are

472: doing research or the number of \emph{active} papers that are likely

473: to be cited just corresponds to the number of active nodes in GKE

474: network, which is denoted as $M$. Since the whole network is divided

475: into $n$ subnetworks, we intuitively know that the number of

476: \emph{active} authors or papers in each subnetwork is inversely

477: proportional to the number of topics, i. e., $M\sim 1/n$. For each

478: subnetwork can be treated as a GKE network, incorporating with our

479: theoretical result $C\sim 1/M$, we can easily obtained that $C\sim

480: n$. Since each subnetwork evolves parallelly, the clustering

481: coefficient of the whole network has the same behavior that it is

482: linearly correlated with the number of topics. Therefore, by using

483: our theoretical result we can indicate that the numerical behavior

484: of clustering coefficient in the coauthor and paper citation network

485: presented in Ref. [19] is due to the deactivation mechanism.

486: Furthermore, in the above discussion, we reduce the aging mechanism

487: to deactivation mechanism. In fact, in the model of Ref. [19], the

488: aging effect is introduced by an aging function. To this point, we

489: conjecture that there might be similar scaling behaviors of $C$ in

490: networks generated by other forms of aging mechanism.

491:

492: Finally, it is worthwhile to point out that, to our knowledge, no

493: empirical data are available to illustrate the theoretical scaling

494: behavior of clustering coefficient. Nevertheless, this interesting

495: property is due to the deactivation process which is a special case

496: of aging effect. In network evolution, aging is a universal

497: mechanism. Therefore, this simple theoretical result of $C$ will

498: have a rich practical significance and potential applications in

499: future network research. Meanwhile, such scaling behavior of $C$

500: should be given further considerations from empirical

501: investigations.

502:

503: \

504:

505: In summary, motivated by the aging effect governing the evolution of

506: the coauthor and paper citation networks, a generalized deactivation

507: model of network called GKE is presented in this paper. We study

508: analytically and by simulations several topological features of this

509: model, such as the degree distribution and clustering coefficient.

510: Most importantly, an interesting scaling behavior of the clustering

511: coefficient $C\sim 1/M$ is obtained, which shows that the numerical

512: result recently observed in the coauthor and paper citation networks

513: is due to deactivation mechanism. By introducing a perturbation, the

514: GKE network undergoes a small-world phase transition, while the

515: scaling behavior of $C$ is conserved. It indicates that $C\sim 1/M$

516: is a universal scaling behavior of clustering coefficient induced by

517: deactivation mechanism. In addition, we would like to emphasize that

518: our study unifies the concept of regular lattice, small-world graphs

519: and scale-free networks in a single model, and the GKE model

520: generalizes the new class of the networks with a crucial parameter

521: $M$.

522:

523: Since the GKE networks present peculiar structure property, it will

524: be interesting to investigate the effect of their complex topology

525: features on the network dynamics[25,29-31]. Especially, the

526: clustering coefficient of GKE network is precisely tunable by

527: parameter $M$ or $p$ without changing the degree distribution.

528: Therefore, the model can be used to quantitatively study the effect

529: of clustering on network synchronization[32-34] and network

530: epidemics[31,35]. Research along this line is in progress.

531:

532: \section*{ACKNOWLEDGMENTS}

533: Shi thanks the program for New Century Excellent Talents in

534: University of China for financial support (NECT-04-0510). This work

535: is also supported by National Natural Science Foundation of China

536: under Grant Nos. 70471084 and 10372045.

537:

538: \begin{thebibliography} {1}

539: \bibitem{1}

540: R. Albert and A.-L. Barab\'asi, Rev. Mod. Phys. \textbf{74}, 47

541: (2002).

542: \bibitem{2}

543: S. N. Dorogrovtsev and J. F. F. Mendels, Adv. Phys. \textbf{51},

544: 1079 (2002).

545: \bibitem{3}

546: M. E. J. Newman, SIAM Rev. \textbf{45}, 167 (2003).

547: \bibitem{4}

548: M. Faloutsos, P. Faloutsos and C. Faloutsos, Computer Communications

549: Review \textbf{29}, 251 (1999).

550: \bibitem{5}

551: R. Pastor-Satorras, A. Vazquez, and A. Vespignani, Phys. Rev. Lett.

552: \textbf{87}, 258701 (2001)

553: \bibitem{6}

554: B. A. Huberman, \emph{The Laws of the web} (MIT Press, Cambridge,

555: 2001).

556: \bibitem{7}

557: R. Albert, H. Jeong and A. -L. Barab\'asi, Nature \textbf{401}, 130

558: (1999).

559: \bibitem{8}

560: H. Jeong, S. P. Mason, A.-L. Barab\'asi, and Z. N. Oltvai, Nature

561: \textbf{411}, 41 (2001).

562: \bibitem{9}

563: E. Ravasz, A. L. Somera, D. A. Mongru, Z. N. Oltvai, and A.-L.

564: Barab\'asi, Science \textbf{297}, 1551 (2002).

565: \bibitem{10}

566: S. L. Pimm, \emph{Food Webs} (University of Chicago Press, Chicago,

567: 2002).

568: \bibitem{11}

569: S. Wasserman and K. Faust, \emph{Social Network Analysis} (Cambridge

570: University Press, Cambridge, 1994); J. Scott, \emph{Socical Network

571: Analysis: A Handbook} (Sage Publications, London, 2000).

572: \bibitem{12}

573: D. J. Watts and S. H. Strogatz, Nature \textbf{393}, 440 (1998).

574: \bibitem{13}

575: A. -L. Barab\'asi and R. Albert, Science \textbf{286}, 509 (1999).

576: \bibitem{14}

577: L. A. N. Amaral, A. Scala, M. Barth\'el\'emy, and H. E. Stanley,

578: Proc. Natl. Acad. Sci. U.S.A. \textbf{97}, 11 149 (2000).

579: \bibitem{15}

580: S. N. Dorogovtsev and J. F. F. Mendes, Phys. Rev. E \textbf{62},

581: 1842 (2000).

582: \bibitem{16}

583: A. F. J. Van Raan, Scientometrics \textbf{47}, 347 (2000);

584: \bibitem{17}

585: H. Zhu, X. Wang, and J.-Y. Zhu, Phys. Rev. E \textbf{68}, 056121

586: (2003); K. B. Hajra and P. Sen, Phys. Rev. E \textbf{70}, 056103

587: (2004); K. B. Hajra and P. Sen, Physica A \textbf{346}, 44 (2005);

588: P.-Q. Jiang, B.-H. Wang, T. Zhou, Y.-D. Jin, Z.-Q. Fu, P.-L. Zhou,

589: and X.-S. Luo, Chin. Phys. Lett. \textbf{22}, 1285 (2005).

590: \bibitem{18}

591: K. Klemm and V. M. Egu\'iluz, Phys. Rev. E \textbf{65}, 036123

592: (2002).

593: \bibitem{19}

594: K. B\"orner, J. T. Maru, and R. L. Goldstone, Proc. Natl. Acad. Sci.

595: U.S.A. \textbf{101}, 5266 (2004).

596: \bibitem{20}

597: A. V\'azquez, M. Bogu\~n\'a, Y. Moreno, R. Pastor-Satorras, and A.

598: Vespignani, Phys. Rev. E \textbf{67}, 046111 (2003).

599: \bibitem{21}

600: C.-P. Zhu, S.-J. Xiong, Y.-J. Tian, N. Li, and K.-S. Jiang, Phys.

601: Rev. Lett. \textbf{92}, 218702 (2004).

602: \bibitem{22}

603: It means that each node is connected symmetrically to its $2z$

604: nearest neighbors. For detailed definition, see B. Bollob\'{a}s,

605: \emph{Random Graphs} (Academic Press, New York, 1985). We set $z=1$

606: in the simulations in this paper for the sake that the influence of

607: initial graph on the evolving of the network is minimal and can be

608: ignored.

609: \bibitem{23}

610: K. Klemm and V. M. Egu\'iluz, Phys. Rev. E \textbf{65}, 057102

611: (2002).

612: \bibitem{24}

613: M. E. J. Newman and D. J. Watts, Phys. Rev. E \textbf{60},

614: 7332(1999).

615: \bibitem{25}

616: V. M. Egu\'iluz, E. Hern\'{a}ndez-Garc\'{i}a, O. Piro, and K. Klemm,

617: Phys. Rev. E \textbf{68}, 055102(R) (2003).

618: \bibitem{26}

619: T. Zhou, G. Yan, and B.-H. Wang, Phys. Rev. E \textbf{71}, 046141

620: (2005).

621: \bibitem{27}

622: Z.-M. Gu, T. Zhou, B.-H. Wang, G. Yan, C.-P. Zhu, and Z.-Q. Fu,

623: arXiv: cond-mat/0505175.

624: \bibitem{28}

625: As we know, when $p=0$, the topology of the GKE network is similar

626: to a one-dimensional lattice. Thus, if $p$ is sufficient small, this

627: approximation is reasonable.

628: \bibitem{29}

629: K. Klemm, V. M. Egu\'iluz, R. Toral, and M. SanMiguel, Phys. Rev. E

630: \textbf{67}, 026120 (2003)

631: \bibitem{30}

632: K. Suchecki, V. M. Egu\'iluz, and M. S. Miguel, Phys. Rev. E

633: \textbf{72}, 036132 (2005)

634: \bibitem{31}

635: V. M. Egu\'iluz and K. Klemm, Phys. Rev. Lett. \textbf{89}, 108701

636: (2002)

637: \bibitem{32}

638: P. N. McGraw, M. Menzinger, Phys. Rev. E \textbf{72}, 015101(R)

639: (2005).

640: \bibitem{33}

641: M. Zhao, T. Zhou, B. -H. Wang, G. Yan, and H. -J. Yang, preprint

642: arXiv: cond-mat/0510332.

643: \bibitem{34}

644: X. Wu, B. -H. Wang, T. Zhou, W. -X. Wang, M. Zhao, and H. -J. Yang,

645: Chin. Phys. Lett. 23, 1046 (2006).

646: \bibitem{35}

647: M. Bogu\~n\'a, R. Pastor-Satorras, and A. Vespignani, Phys. Rev.

648: Lett. \textbf{90}, 028701 (2003).

649:

650:

651:

652:

653:

654:

655:

656: \end{thebibliography}

657:

658:

659:

660: \end{document}

661: