0511:physics0511073/yang.tex

1: \documentclass[aps,pre,twocolumn,floatfix,nofootinbib,showpacs]{revtex4}

2: %\documentclass[pre,preprint,showcase,floatfix]{revtex4}

3: \usepackage[dvips]{graphicx}

4: \usepackage{epsf,amssymb,amsmath,mathptmx}

5: \begin{document}

6: \title{Bidding process in online auctions and winning strategy:

7: rate equation approach}

8: \author{I.~Yang and B. Kahng}

9: \affiliation{School of Physics and Center for Theoretical Physics,

10: Seoul National University, Seoul 151-747, Korea}

11: \date{\today}

12: \begin{abstract}

13: Online auctions have expanded rapidly over the last decade and have

14: become a fascinating new type of business or commercial transaction

15: in this digital era. Here we introduce a master equation for the

16: bidding process that takes place in online auctions.

17: We find that the number of distinct bidders who bid $k$ times,

18: called the $k$-frequent bidder, up to the $t$-th bidding progresses

19: as $n_k(t)\sim tk^{-2.4}$.

20: The successfully transmitted bidding rate by the $k$-frequent

21: bidder is obtained as $q_k(t) \sim k^{-1.4}$, independent of

22: $t$ for large $t$. This theoretical prediction is in agreement with

23: empirical data. These results imply that bidding at the

24: last moment is a rational and effective strategy to win in an eBay

25: auction.

26: \end{abstract}

27: \pacs{89.75.-k, 89.75.Da, 89.65.-s, 89.65.Gh}

28: \maketitle

29: Electronic commerce (e-commerce) refers to any type of

30: business or commercial transaction that involves information

31: transfer across the Internet. As a formation of e-commerce,

32: the online auction, i.e., the auction via the Internet~\cite{heck},

33: has expanded rapidly over the last decade and has become a

34: fascinating new type of business or commercial transaction in

35: this digital era. Online auction technology has several benefits

36: compared with traditional auctions. Traditional auctions require

37: the simultaneous participation of all bidders or agents at the

38: same location; these limitations do not exist in online auction

39: systems. Owing to this convenience, ``eBay.com," the largest

40: online auction site, boasts over 40 million registered consumers

41: and has experienced rapid revenue growth in recent years.

42:

43: Interestingly, the activities arising in online auctions generated

44: by individual agents proceed in a self-organized

45: manner~\cite{mantegna,bouchard, stanley,challet,pennock,hulst}. For

46: example, the total number of bids placed in a single item or

47: category and the bid frequency submitted by each agent follow power-law

48: distributions~\cite{yang}. These power-law behaviors~

49: \cite{simon,zhang,albert} are rooted in the fact that an agent who

50: makes frequent bids up to a certain time is more likely to bid

51: in the next time interval. This pattern is theoretically analogous

52: to the process that is often referred to as preferential attachment,

53: which is responsible for the emergence of scaling in complex

54: networks~\cite{ba}. This is reminiscent of the mechanism of

55: generating the Zipf law~\cite{zhang,pareto}. The accumulated data of

56: a detailed bidding process enable us to quantitatively characterize

57: the dynamic process. In this paper, we describe a master equation

58: for the bidding process.

59: The master-equation approach is useful to capture the dynamics

60: of the online bidding process because it takes into account of the effect

61: of openness and the non-equilibrium nature of the auction.

62: This model is in contrast to the existing equilibrium approach~\cite{lb,kauf}

63: in which there is a fixed number of bidders. The equilibrium approach

64: is relevant to traditional auctions; however, it is unrealistic

65: to apply this approach to Internet auctions. The power-law

66: behavior of the bidding frequency submitted by individual agents

67: can be reproduced from the master equation.

68: Moreover, we consider the probability of an agent

69: who has bidden $k$ times, called the $k$-frequent

70: bidder, becoming the final winner. We conclude that the winner is

71: likely to be the one who bids at the last moment but who placed

72: infrequent bids in the past.

73:

74: Our study is based on empirical data collected from two

75: different sources~\cite{yang}. The first dataset was downloaded

76: from the web, http://www.eBay.com, and is composed of all the auctions

77: that closed in a single day. The data include 264,073 auctioned items,

78: grouped into 194 subcategories. The dataset allows us

79: to identify 384,058 distinct agents via their unique user IDs. To

80: verify the validity of our findings in different markets and time

81: spans, the second dataset was accumulated over a period of one year from

82: eBay's Korean partner, auction.co.kr. The dataset comprised 215,852 agents

83: that bid on 287,018 articles in 355 lowest categories.

84:

85: An auction is a public sale in which property or items of merchandise

86: are sold to the bidder who proposes the highest price. Typically,

87: most online auction companies adopt the approach of English auction, in

88: which an article or item is initially offered at a low price that is

89: progressively raised until a transaction is made. Both ``eBay.com"

90: and ``auction.co.kr" adopt this rule and many bidders submit multiple

91: bids in the course of the auction. An agent is not allowed to place

92: two or more bids in direct succession. It is important to notice

93: that the eBay auction has a fixed end time: It typically ends a

94: week after the auction begins, at the same time of day to

95: the second. The winner is the latest agent to bid within this period.

96: In such an auction that has a fixed deadline, bidding that takes

97: place very close to the deadline does not give other bidders sufficient

98: time to respond.

99: In this case, a sniper--the last moment bidder--might win the

100: auction, while the bid that follows has a substantial probability

101: of not being transmitted successfully. While such a bidding pattern is

102: well known empirically, no quantitative analysis has been

103: performed on it as yet. In this study we analyze this issue through

104: the rate equation approach.

105:

106: To characterize the dynamic process, we first introduce several

107: quantities for each item or article as follows:

108: \begin{itemize}

109: \item[(i)] When a bid is successfully transmitted, time $t$

110: increases by one.

111: \item[(ii)] Terminal time $T$ is the time at which an

112: auction ends. Thus, the index of bids runs from $i=1$ to $T$.

113: \item[(iii)] $N(t)$ is the number of distinct bidders who

114: successfully bid at least once up to time $t$.

115: Thus, the index of bidders (or agent) runs from $i=1$ to $N(t)$.

116: \item[(iv)] $k_i(t)$ is the number of successful

117: bids transmitted by an agent $i$ up to time $t$.

118: \item[(v)] $n_k(t)$ is the number of bidders with

119: frequency $k$ up to time $t$.

120: \end{itemize}

121: From the above, we obtain the relations

122: \begin{equation}

123: N(t)=\sum_k n_k(t)

124: \end{equation} and

125: \begin{equation} t=\sum_{k} k n_{k}(t)

126: \end{equation}

127: for any time $t$ including the terminal time $T$.

128:

129: It is numerically found that $T$ is linearly proportional to $N(T)$,

130: that is, $T \propto N(T)$. The average value of the

131: proportional coefficient $a$ for different items or articles listed

132: in eBay is estimated to be $a \approx 1$ when the total number

133: of bidders $N(T)$ exceeds $20$. However, when the number of

134: bidders is lower, the proportional coefficient is very large, as

135: shown in Fig.~\ref{fig:K_N_each}. For the Korean auction, $a

136: \approx 4.5$, regardless of the number of bidders. On the other

137: hand, the bidding frequencies and the number of bidders for each

138: article are not uniform. Their distributions, denoted as $P_f (T)$

139: and $P_n (N)$, respectively, follow the exponential functions $P_f (T)\sim

140: \exp(-T/T_c)$ and $P_n(N)\sim \exp(-N/N_c)$, respectively, where

141: $T_c\approx 7.4$ and 10.8 for the eBay and Korean auctions,

142: respectively, and $N_c\approx 2.5$ and 5.6 for the eBay and

143: Korean auction, respectively (Fig.\ref{fig2}).

144:

145: %%%%%%%%%FIGURE

146: \begin{figure}[t]

147: \centerline{\epsfxsize=8cm \epsfbox{fig1.eps}}

148: \caption{Plot of $T$ versus $N(T)$ for the eBay.com (a) and the

149: Korean auction (b). The dotted line has a slope of 1 both in (a)

150: and 4.5 in (b).} \label{fig:K_N_each}

151: \end{figure}

152: %%%%%%%%%%%%%%%%%%%

153:

154: %%%%%%%%%FIGURE

155: \begin{figure}

156: \centerline{\epsfxsize=8cm \epsfbox{fig2.eps}}

157: \caption{Plot of $P_f(T)$ versus $T$ in (a) and (c), and $P_n(N)$

158: versus $N$ in (b) and (d) for the eBay  (a) and (b) and Korean

159: auction (c) and (d)

160: in semi-logarithmic scale. The dotted lines have slopes of $2.5$ in

161: (a), $5.6$ in (b), $7.4$ in (c), and $10.8$ in (d).} \label{fig2}

162: \end{figure}

163: %%%%%%%%%%%%%%%%%%%

164: %%%%%%%%%FIGURE

165: \begin{figure}

166: \centerline{\epsfxsize=8cm \epsfbox{fig3.eps}}

167: \caption{Plot of $\langle dk/dt \rangle$ versus $k/t$ for the eBay (a)

168: and for the Korean auction (b). The dotted lines obtained

169: by the least square fit in the range [0.1:1] (a) and

170: [0.01:1] (b), respectively, fit to the formula, $\approx 0.7k/t$.}

171: \label{fig:w_k_t}

172: \end{figure}

173: %%%%%%%%%%%%%%

174:

175: %%%%%%\section{Stochastic process}

176: We introduce the master equation for the bidding process as

177: \begin{equation}

178: n_k(t+1)-n_k(t)=w_{k-1}(t)n_{k-1}(t)-w_k(t)n_k(t)+\delta_{k,1}u_t,

179: \label{discrete}

180: \end{equation}

181: where $w_k(t)$ is the transition probability that a bidder, who

182: has bid $k-1$ times up to time $t-2$, bids at time $t$.

183: In this case, the total bid frequency of that agent up to time $t$

184: becomes $k$.

185: Note that a bidder is not allowed to bid successively. In the

186: master equation, we presume that the bidding pattern is similar

187: over different items when $N(T)$ is sufficiently large. Then,

188: ${w_k}(t)$ may be written as ${w_k}(t) \approx \langle dk/dt

189: \rangle$ on average over different items. Empirically, we find

190: that

191: \begin{equation}

192: {w_k}(t)\approx \langle dk/dt \rangle \approx bk/t, \label{kernel}

193: \end{equation}

194: where $b$ is estimated to be $b\approx 0.7$ for both the eBay and

195: Korean auctions (Fig.~\ref{fig:w_k_t}). The fact that $w_k

196: \propto k$ is reminiscent of the preferential attachment rule in

197: the growing model of the complex network~\cite{ba}. $u_t$ is the

198: probability that a new bidder makes a bid at time $t$. Using the

199: property that $\sum_k n_k (t)=N(t)$, we obtain

200: \begin{equation}

201: u_t=N(t+1)-N(t).

202: \end{equation}

203:

204: Next we then change the discrete equation, Eq.~(\ref{discrete}), to

205: a continuous equation as follows:

206: \begin{equation}

207: \frac {\partial n_k(t)}{\partial t}=-\frac {\partial}{\partial

208: k}\big({w_k}(t)n_k(t)\big)+\delta_{k,1} u_t, \label{continuous}

209: \end{equation}

210: which can be rewritten as

211: \begin{equation}

212: \frac {\partial n_k(t)}{\partial t}=-\frac{b}{t}\frac

213: {\partial}{\partial k}\big(k n_k(t)\big)+\delta_{k,1} u_t.

214: \label{continuous2}

215: \end{equation}

216: When $k > 1 $, we use the method of separation of variables,

217: $n_k(t)=I(k)T(t)$, thus obtaining

218: \begin{equation}

219: \frac{\partial}{\partial k}(k I(k))+\ell I(k)=0, \label{eq_K}

220: \end{equation}

221: where $\ell$ is a constant of separation, and

222: \begin{equation}

223: \frac{\partial T(t)}{\partial t}= \frac {b\ell}{t}T(t).

224: \end{equation}

225: Thus, we obtain

226: \begin{equation}

227: n_k(t)\sim t^{b\ell} k^{-(1+\ell)}.

228: \end{equation}

229: When $k=1$,

230: \begin{eqnarray}

231: \frac{\partial n_1(t)}{\partial t}=-\frac{b}{t}n_1(t)

232: +u_t.

233: \label{eq:n_1}

234: \end{eqnarray}

235:

236: Next from the fact that $N=\sum_k n_k$, we obtain

237: \begin{eqnarray*}

238: \frac{\partial N}{\partial t} &=&

239: \sum_{k > 1}\frac{\partial n_k}{\partial t}+\frac{\partial n_1}{\partial t}\\

240: &=&\sum_{k>1}-\frac{b}{t}\frac{\partial}{\partial k}\Big(kn_k\Big)

241: -\frac{b}{t}n_1+\frac{\partial N}{\partial t}\\

242: &=& \frac{b\ell}{t}(N-n_1)-\frac{b}{t}n_1+\frac{\partial N}{\partial t}.

243: \end{eqnarray*}

244: Therefore, we obtain $N(t)=(1+1/\ell)n_1(t)$ and $n_1(t)\sim

245: t^{b\ell}$ by using Eq.~(\ref{eq:n_1}). Note that $N(t) < t$, and

246: the linear relationship holds asymptotically. The linear relationship

247: breaks down for small $t$. From the empirical data,

248: Fig.~\ref{fig:N_t_fig}, we find that $\ell b\approx 1$. Since

249: $b\approx 0.7$ in Fig.~\ref{fig:w_k_t}, we obtain $\ell

250: \approx 1/b\approx 1.4$. Therefore,

251: \begin{equation}

252: n_k(t)\sim t k^{-2.4} \label{n_k} \end{equation}

253: for large $t$, which fits reasonably with the numerical data

254: shown in Fig.\ref{fig:N_DNST}.

255:

256: %%%%%%%%%FIGURE

257: \begin{figure}

258: \centerline{\epsfxsize=8cm \epsfbox{fig4.eps}}

259: \caption{Plot of $\langle N(t) \rangle$ versus $t$, on average

260: over different items for the eBay data. The straight line has a

261: slope of $0.7$ obtained from the least square fit.

262:  } \label{fig:N_t_fig}

263: \end{figure}

264: %%%%%%%%%%%%%%%%%%%

265:

266: %%%%%%%%%FIGURE

267: \begin{figure}[b]

268: \centerline{\epsfxsize=8cm \epsfbox{fig5.eps}}

269: \caption{Plot of $n_k$ versus $k$ for the eBay auction

270: at ebay.com (a) and for the Korean auction at

271: auction.co.kr (b) for various terminal times $T$. The solid lines

272: have a slope of -2.4 drawn for guidance.}

273: \label{fig:N_DNST}

274: \end{figure}

275: %%%%%%%%%%%%%%%%%%%

276:

277: In eBay auctions, the winner is the last bidder in the bidding

278: sequence. Now, we trace the bidding activity of the winner in the

279: bidding sequence in order to find the winning strategy. To

280: proceed, let me define $q_{k}(t+1)$ as the probability that a

281: bidder, who has bid $k-1$ times up to time $t-1$, bids at time

282: $t+1$ successfully. Note that a bidder is not allowed to bid

283: successively. In this case, $q_k(T)$ is nothing but the probability that a

284: $k$-frequent bidder becomes the final winner. The probability

285: $q_k(t+1)$ satisfies the relation,

286: \begin{eqnarray}

287: q_{k}(t+1)&=&

288: (1-u_{t+1})\sum_{j=1}^{N(t)}q_{j}(t)\frac{(k-1)(n_{k-1}(t)-

289: \delta_{j,k-1})}{t-j}\nonumber \\

290: &+&\delta_{k,1}u_{t+1}\label{eq:win}

291: \end{eqnarray}

292: with the boundary conditions $q_{1}(1)=1$ and $q_{1}(2)=1$. The

293: first term on the right hand side of Eq.~(\ref{eq:win}) is

294: composed of three factors: (i) $1-u_{t+1}$ is the

295: probability that one of the existing bidders bids successfully at

296: time $t+1$, (ii) $q_j(t)$ means that bidding at

297: time $t$ is carried out by the $j$-frequent bidder, and (iii) the last

298: factor is derived from the bidding rate, Eq.~(\ref{kernel}), where the

299: contribution by the bidder at time $t$ is excluded because he/she

300: is not allowed to bid at time $t+1$. The second term represents

301: the addition of a new bidder at time $t$.

302:

303: The rate equation, Eq.~(\ref{eq:win}), can be solved recursively.

304: To proceed, we simplify Eq.~(\ref{eq:win}) by assuming that

305: $n_{k-1}(t)$ is significantly larger than $\delta_{j,k-1}$, which is

306: relevant when the number of bidders is large. Then,

307: \begin{widetext}

308: \begin{eqnarray}

309: q_k(t+1)& \approx & (1-u_{t+1})

310: \sum_{i=1}^{N(t)}q_i(t)\frac{(k-1)n_{k-1}(t)}

311: {t-i}+\delta_{k,1}u_{t+1}\nonumber \\

312: &=& (k-1)n_{k-1}(t)\prod_{\tau=2}^{t} (1-u_{\tau+1})

313: \Big[ \sum_{i=1}^{\tau-1} \frac {(i-1)n_{i-1}(\tau)}{(\tau-i)} \Big] q_{1}(2) \\

314: &+&(1-u_{t+1})(k-1) n_{k-1}(t)\sum_{\tau=3}^{t}  \frac{

315: u_{\tau}}{\tau-1}\prod_{\tau'=\tau+1}^{t}(1-u_{\tau'}) \Big[

316: \sum_{i=1}^{\tau^{\prime}-1} \frac

317: {(i-1)n_{i-1}(\tau^{\prime})}{(\tau^{\prime}-i)}\Big]+

318: u_{t+1}\delta_{k,1}\nonumber.

319: \end{eqnarray}

320: \end{widetext}

321: Since $1-u_t\approx 0.3 < 1$, $q_k(t)$ is obtained to be

322: \begin{equation}

323: q_k(t)\approx

324: (1-u_{t-1})\frac{(k-1)n_{k-1}(t-1)}{t-2}+\delta_{k,1}u_t

325: \end{equation}

326: within the leading order. Considering that $n_k(t)\sim

327: tk^{-2.4}$ in Eq.~(\ref{n_k}) and $u_t$ is constant, we obtain

328: $q_k(t)\sim (t-1)k^{-1.4}/(t-2)$ for large $k$ and $t$,

329: with a weak dependence on $t$. Thus, the winning probability

330: by the $k$-frequent bidder is simply given as

331: \begin{equation}

332: q_k(T)\sim k^{-1.4} \end{equation}

333: in the limit $t\to \infty$. This result is confirmed by the

334: empirical data in Fig.~\ref{fig:winning}.

335:

336: %%%%%%%%%FIGURE

337: \begin{figure}

338: \centerline{\epsfxsize=8cm \epsfbox{fig6.eps}}

339: \caption{Plot of the relative winning probability $q_k(T)/q_1(T)$

340: of the $k$-frequent bidder to that of the one-frequent bidder at

341: the last moment versus frequency $k$.

342: The dotted line has a slope of -1.4 drawn for guidance.}

343: \label{fig:winning}

344: \end{figure}

345: %%%%%%%%%%%%%%%%%%%

346:

347: Our analysis explicitly shows that the winning strategy is to bid

348: at the last moment as the first attempt rather than

349: incremental bidding from the start. This result is consistent

350: with the empirical finding by Roth and Ockenfels~\cite{roth} in

351: eBay. According to them, the bidders who have won the most items tend

352: to wait till the last minute to submit bids, albeit there is some

353: probability of bids not being successfully transmitted.

354: As evidence, they studied 240 eBay auctions and found that 89

355: bids were submitted in the last minute and

356: 29 in the last ten seconds. Our result supports these empirical

357: results.

358:

359: In conclusion, we have analyzed the statistical properties of

360: emerging patterns created by a large number of agents based on the

361: empirical data collected from eBay.com and auction.co.kr.

362: The number of bidders and the winning probability

363: decay in power laws as $n_k(t)\sim tk^{-2.4}$ and $q_k(t)\sim

364: k^{-1.4}$, respectively, with bid frequency $k$, which

365: has been confirmed by empirical data.\\

366:

367: This work is supported by the KRF Grant No. R14-2002-059-01000-0

368: in the ABRL program funded by the Korean government MOEHRD and

369: the CNS research fellowship in SNU (BK).

370:

371: \begin{thebibliography}{99}

372: \bibitem{heck} E. van Heck and P. Vervest, Communication of the ACM

373: {\bf 41}, 99 (1998).

374: \bibitem{mantegna} R.N. Mantegna and H.E. Stanley,

375: {\textit An introduction to econophysics:

376: Correlations and complexity in finance} (Cambridge University Press,

377: Cambridge, 2000).

378: \bibitem{bouchard} J.P. Bouchard and M. Potters, {\textit

379: Theory of financial risks:

380: From statistical physics to risk management} (Cambridge University Press,

381: Cambridge, 2000).

382: \bibitem{stanley} M.H.R. Stanley, L.A.N. Amaral, S.V. Buldyrev, S. Havlin,

383: H. Leschhorn, P. Maass, M.A. Salinger and H.E. Stanley,

384: {Nature} {\bf 379}, 804 (1996).

385: \bibitem{challet} D. Challet and Y.-C. Zhang, {Physica A}

386: {\bf 246}, 407 (1997).

387: \bibitem{pennock} D.M. Pennock, S. Lawrence, C.L. Giles and F.A. Nielsen,

388: {Science} {\bf 291}, 987 (2001).

389: \bibitem{hulst} R. D'Hulst and G.J. Rodgers, {Physica A} {\bf 294}, 447 (2001).

390: \bibitem{yang} I. Yang, H. Jeong, B. Kahng, and A.-L. Barab\'asi,

391: Phys. Rev. E {\bf 68}, 016102 (2003).

392: \bibitem{simon} H.A. Simon, {Biometrika} {\bf 42}, 425 (1955).

393: \bibitem{zhang} M. Marsili and Y.-C. Zhang, {Phys. Rev. Lett.} {\bf 80},

394: 2741 (1998).

395: \bibitem{albert} R. Albert and A.-L. Barabasi, {Rev. Mod. Phys.} {\bf 74}, 47

396: (2002).

397: \bibitem{ba} A.-L. Barabasi and R. Albert, {Science} {\bf 286}, 509 (1999).

398: \bibitem{pareto} V. Pareto, Cours d'economie politique (Rouge,

399: Lausanne et Paris, 1897).

400: \bibitem{lb} Y. Shoham and M. Tennenholtz,

401: Games and Economic Behavior, {\bf 35}, 197 (2001).

402: \bibitem{kauf} R.J. Kauffman and C.A. Wood, {Proc. of ICIS} (2000).

403: \bibitem{roth} A.E. Roth and A. Ockenfels, {American Economic Review},

404: {\bf 92}, 1093 (2002).

405: \end{thebibliography}

406: \end{document}

407:

408: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%

409: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%

410: %%%%%%%%END OF DOCUMENT%%%%%%%%%

411: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%

412: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%

413: