0112:cs0112015/cs0112015

1: \documentstyle [12pt]{article}

2: %\documentstyle [aaai]{article}

3: %\documentstyle [ijcai01]{article}

4: %\parskip\medskipamount

5: \newcommand {\ignore} [1] {}

6: \newtheorem{theorem}{Theorem}[section]

7: \newtheorem{lemma}[theorem]{Lemma}

8: \newtheorem{corollary}[theorem]{Corollary}

9: \newtheorem{definition}{Definition}[section]

10: \newtheorem{proposition}[theorem]{Proposition}

11: \newtheorem{claim}[theorem]{Claim}

12: \newtheorem{observation}[theorem]{Observation}

13: \newenvironment{proof}{\noindent{\bf Proof:\/}}{\hfill $\Box$\vskip 0.1in}

14: \newenvironment{proofsp}{\noindent{\bf Proof}}{\hfill $\Box$\vskip 0.1in}

15: \def \AA{{\cal A}}

16: \def \GG{{\cal G}}

17: \def \HH{{\cal H}}

18: \def \TT{{\cal T}}

19: \def \PP{{\cal P}}

20: %\input{epsf}

21:

22: \begin{document}

23:

24: \title{Rational Competitive Analysis}

25:

26: \author{Moshe Tennenholtz

27: \thanks{Permanent address: Faculty of Industrial Engineering and Management,

28: Technion--Israel Institute of Technology, Haifa 32000, Israel}\\

29:  Computer Science Department\\

30:  Stanford University \\

31:   Stanford, CA 94305}

32: \date{}

33: \maketitle

34:

35: \begin{abstract}

36: Much work in computer science has adopted competitive analysis as a

37: tool for decision making under uncertainty. In this work we extend

38:  competitive analysis to the context of multi-agent systems. Unlike

39: classical competitive analysis where the behavior of an agent's environment

40: is taken to be arbitrary, we consider the case where an agent's environment

41: consists of other agents. These agents will usually obey some (minimal)

42: rationality constraints. This leads to the definition of {\em rational

43: competitive analysis}. We introduce the concept of rational competitive

44: analysis, and initiate the study of competitive analysis for

45: multi-agent systems. We also discuss the application of rational

46: competitive analysis

47:  to the context of bidding games, as well as to the classical one-way

48: trading problem.

49: \end{abstract}

50:

51:

52: \section{Introduction}

53:

54: Competitive analysis is a  central tool for the design and analysis

55: of algorithms and protocols for decision making under uncertainty

56: \cite{BorElYa}.

57: % Competitive analysis is an approach to decision making

58: %in face of uncertainty.

59: It is a well studied and widely applicable approach that fits the framework

60: of qualitative decision-making in AI (see e.g.

61: \cite{DT97,BraTenjour}).

62: %,tan-pearl:1994a,doyle-wellman:1994a}).

63: %BTJACM99,Lehmann96,

64: The competitive analysis approach attempts to minimize the ratio

65: between the payoff an  agent obtains and the payoff

66:  he could have obtained had

67: he known the behavior of the environment. For example, consider the

68: following trading problem (see \cite{BorElYa}, Chapter 14).

69: An agent who holds \$100 may wish to exchange them for British pounds. At each point in time, e.g. every minute in between 8AM and 4PM,

70: an exchange ratio

71:  of dollars

72: and pounds is announced. This ratio changes dynamically, and in

73: an unpredicted

74: manner.

75: The agent would need to choose the time in which it will trade his

76: \$100 for pounds.

77: Notice that if

78: the agent would have known the sequence of exchange rates, $e$,

79: then  he could have chosen a strategy $o(e)$ that maximizes his

80: payoff. If the agent chooses a  strategy $s$, then we

81: can compute the ratio of the payoffs obtained by $o(e)$ and $s$.

82: We can do similarly for every sequence of exchange rates $e'$.

83: Based on this, we can compute

84: the highest (i.e. worst) ratio, over all possible sequences, that might

85: be obtained when we compare optimal strategies to $s$. This ratio is

86: denoted by $R(s)$.  According to the competitive analysis approach,

87: the agent will apply the {\em competitive ratio} decision criterion:

88:  he will choose

89:   a strategy $s$ for which $R(s)$ is

90: minimal. This decision criterion may be quite helpful when we

91: lack probabilistic  assumptions about the environment. For

92: example, assume that the minimal value of $R(\cdot)$, which

93:  is obtained by  some

94: strategy $s$, is 2. Then, by selecting $s$,

95:  the agent guarantees himself a payoff which is at

96: least half of the optimal payoff that he could have obtained had

97: he known the actual environment behavior.

98:

99: The competitive ratio has also an additive variant (also termed

100: minimax regret \cite{Milnor}), where we replace the term "ratio"

101: by the term "difference" in the definition of $R$. So, in our

102: example, if a strategy $s$ that minimizes $R(\cdot)$ obtains a

103: competitive difference/regret of 20,

104: then this implies that by performing $s$ the agent

105: gets a payoff (e.g. worth in British pounds) which is at most 20

106: lower than what he  could have obtained had he known the behavior of the

107: environment. In the sequel,  we will use the additive version of

108: the competitive ratio decision criterion.

109:

110:  Competitive analysis

111: % based on the competitive ratio decision

112: % rule,

113:  has been applied to a variety of classical problems

114:  in computer science,

115:  such as the k-server problem \cite{KoutsoupiasPapadimitriou} and

116: paging \cite{FiatKarpLuby}, as well as to

117: more  general algorithmic

118: problems \cite{BorodinLinialSaks}.

119:  In all of these studies the environment that the agent acts in is

120:  non-strategic,  and therefore does not assume to follow any "rational"

121:  behavior. In this paper we extend the concept of competitive

122:  analysis to the context of multi-agent systems. In a multi-agent system

123:  the

124:  environment in which an agent takes his decision consists of

125:  other "rational" agents. Following

126: previous work on competitive analysis, our

127:  approach is non-Bayesian and normative;

128:  we would like to find a decision rule for

129:  the agent that will rely  as little as possible on assumptions about

130:  the behavior of his environment. Therefore, we adopt the requirement

131:  that agent $A$ should

132:  rule out a behavior $b_1$ of agent $B$ only if $b_1$ is dominated

133:  by another behavior, $b_2$, of that agent.

134:  Dominated behaviors are purely irrational in any decision

135:  making model. The agent will choose his behavior according to the

136:  competitive ratio decision rule. However, he should consider only

137:  rational behaviors of the other agents; a behavior of an

138: agent will be considered

139:  irrational if and only if it is dominated by another behavior of it.

140:

141:

142:  In section 2 we describe bidding games, a family of games that

143:  will serve us for the illustration of the basic concepts developed

144:  in this paper. Bidding games are representatives of k-price

145:  auctions, a central class of economic mechanisms \cite{kpricegeb}.

146:  In section 3 we present a competitive analysis of bidding games.

147:  In section 4 we introduce rational competitive analysis, a new

148:  tool for normative decision making, that generalizes competitive

149:  analysis to the context of rational environments, and apply it

150: to bidding games.

151: In section 5 we consider repeated (multi-stage) games. We present several

152:  results on the relationships

153:  between (rational) competitive analysis

154: of repeated games and the competitive analysis

155: of the particular (one-shot) games they consist of.

156: Then, in section 6,  we discuss and study  variants and modifications

157:  of the one-way trading problem, using rational competitive

158: analysis.  In particular, we

159:  study the multi-agent one-way trading problem.

160:

161: \section{Bidding games}

162:

163: We start by recalling the general definition of a

164: (strategic form) game.

165:

166: \begin{definition}

167: A {\em game} is a tuple $G=\langle

168: N=\{1,2,\ldots,n\},S=\{S_i\}_{i=1}^n, \{U_i\}_{i=1}^n \rangle$,

169: where $N$ is a set of $n \geq 2$ players, $S_i$ is the set of

170: strategies available to player $i$, and $U_i: \Pi_{j=1}^n S_j

171: \rightarrow R$ is the utility function of player $i$.

172: \end{definition}

173:

174: In a game, each player selects a strategy from a set of available strategies.

175: The tuple of strategies selected, one by each player,  determines

176:  the payoff of each of the agents%

177: \footnote{Here and elsewhere we use the terms player and agent

178: interchangeably.} (as prescribed by the utility functions).

179:

180: In a bidding game, a center attempts to obtain a service from a set of

181: potential suppliers. Each such supplier has a certain cost associated

182: with that service. This cost is taken to be an integer between $K-T$

183: and $K$, where $K$ and $T$ are w.l.o.g integers, $K>T > 0$.  Each agent

184: will offer his service and ask for  a payment in the range in between

185: $K-T$ and $K$. We will associate the request for payment of

186:  $K-i$ with the

187: integer $i$, where $0 \leq i \leq T$.  The center will choose

188: as the service provider the supplier

189:  with the lowest asking price.  There are various ways

190: for determining the payment to that agent; in particular, the agent can be

191: paid his asking price, the second lowest asking price,

192: or the third lowest

193: asking price. We assume  that the costs for providing the service by

194: each of the agents are common knowledge among them, although

195: the center might not be familiar with these costs.

196:  Although this is quite natural

197: for the above procurement problem, other assumptions can be treated

198: similarly.

199:

200: Our definition of bidding games will capture

201: the above, by considering

202: a fully isomorphic situation, namely: the

203: auctioning of a good.

204: The good is held by the center.

205: Each agent has a valuation (i.e. maximal willingness to pay) for the good.

206:  Each agent needs to decide on his bid. The center will

207: allocate the good to the agent with the highest bid (rather than

208: to the agent with the lowest asking price,

209: as in the isomorphic procurement problem).

210:

211:

212: Formally, we have:

213:

214:

215: \begin{definition}

216: Given a set of $n$ players, and an integer $T >> 1$,

217:  a bidding game is determined by

218:  the tuple

219: B=$(x_1,\ldots,x_n,k)$ where $x_i={l_i \over T}$ for some integer

220: $0 \leq l_i \leq T$, and $1 \leq k \leq n$ is an integer.

221: Player $i$'s strategy in $B$ is a decision about $b_i\in [0,T]$.

222: Given a strategy profile $b=(b_1,b_2,\ldots,b_n)$ denote by $b_{[i]}$

223: the $i$-th order statistic of this tuple.

224: Let $M(b)$ be the number of elements of $b$ that equals $b_{[1]}$.

225: Then, $U_i(b)= {1 \over M(b)} (x_i - {b_{[k]} \over T})$ if

226: $b_i=b_{[1]}$, and $U_i(b)=0$ otherwise.

227: \end{definition}

228:

229:

230:

231: In the above formalism, $x_i$ is the valuation of agent $i$

232: (that is normalized to the interval [0,1]), while $b_i$ denotes

233: the bid made by agent $i$.

234: The

235: payment made by the winner

236: is determined by the parameter $k$. If $k=1$ we get the

237: standard high-bid wins (or first-price) auction; if $k=2$ then we

238: get the famous Vickrey (second-price) auction, while if $k=3$ we

239: get the case of third-price auctions.%

240: \footnote{Third-price auctions have been shown to have

241: appealing properties in the context of Internet Auctions

242: \cite{kpricegeb}.}

243:

244:

245:  For ease of presentation we will assume that $ 2 \leq l_i <  T$

246: for every $1 \leq i \leq n$, that $i \neq j$ implies $l_i \neq

247: l_j$, and that $T \geq n$.

248:

249: \section{Competitive Analysis}

250:

251: In a game, agent $i$  is facing an environment that consists of

252: the other agents. The actions to be selected by these agents are

253: not under the control of $i$. Following the literature on

254: competitive analysis, the competitive ratio decision rule

255: may be used in order

256: to choose an action for that agent.

257:

258: \noindent{\bf Definition 3.1}

259: %\begin{definition}

260: %

261: {\em

262: Given a game $G$, and a strategy profile $s=(s_1,s_2,\ldots,s_n)

263: \in \Pi_{j=1}^n S_j$, the regret of player $i$ is given by

264: $Reg_i(s_i,s_{-i})=max_{t \in S_i}

265: U_i(s_1,\ldots,s_{i-1},t,s_{i+1},\ldots,s_n) - U_i(s)$. A

266: strategy $s \in S_i$ is a competitive strategy for agent

267: $i$ if $s \in argmin_{t

268: \in S_i} max_{q \in S_{-i}} Reg_i(t,q)$, where $S_{-i}$ denotes

269: the possible strategy profiles of players in $N \setminus \{i\}$.}

270: %\end{definition}

271:

272: Given the above definition we are interested in applying

273: competitive analysis to bidding games. We now present three

274: claims about competitive analysis of  bidding games.

275: These claims are associated with the competitive analysis of

276:  1st,2nd, and 3rd-price auctions, respectively.

277:

278: \begin{claim} Given the bidding game

279: B=$(x_1,\ldots,x_n,1)$, a competitive strategy for agent $i$ yields

280: a regret value of $\alpha \over T$, where $\alpha$

281: equals the upper integer value of ${{l_i-1} \over {2}}$.

282:

283: \end{claim}

284:

285: \noindent{Basic idea behind proof:} Agent $i$ can lose by

286: submitting a bid that is higher than his valuation. On the other

287: hand, by submitting a bid that is below ${l_i}-1$ agent $i$ might

288: lose, since agent $j \neq i$ might submit $l_i-1$ as a  winning bid.

289:  Since agents may submit the bid 0,

290: agent $i$ will minimize his regret by submitting a bid

291: that equals (the upper integer value of) half of the difference

292: between  $l_i-1$ and 0.

293:

294: \begin{claim} Given~the~bidding game B=$(x_1,\ldots,x_n,2)$ a competitive strategy

295: for agent $i$ yields a zero

296: regret.

297:

298: \end{claim}

299:

300: \noindent{Basic idea behind proof:} Here the optimal strategy for

301: an agent, regardless of what the others do, is to send his actual

302: valuation as his bid; this is a well known property of the Vickrey

303: auction \cite{Wolfstetter}. As a result we get a regret of 0.

304:

305: \begin{claim} Given the bidding game

306: B=$(x_1,\ldots,x_n,3)$,  and assume w.l.o.g that $x_1 > x_2 > \cdots > x_n$,

307: then  agent $j$'s competitive strategy is to send the bid

308: ${min(2l_j,T)}$.

309:

310:

311: \end{claim}

312:

313: \noindent{Basic idea behind proof:} Given that agents may submit

314: the bid 0, agent $j$ might reach a regret of ${{l_j} \over T}$ if he is not

315: the winner. Submitting however a bid that is higher than $2l_j$

316: may also lead to a regret of ${{l_j} \over T}$, given that the agents may

317: submit $2l_j$ as their bids. Combining these

318: observations, we

319: get that submitting the bid ${{min(2l_j,T)}}$ is the

320: competitive strategy.

321:

322:

323:

324: \section{Rational Competitive Analysis}

325:

326: Although competitive analysis is a most powerful concept from a

327: non-Bayesian

328: normative perspective, it may be quite restrictive when we consider

329: decision-making in multi-agent systems. Following the spirit of competitive

330: analysis for normative decision making,

331: we refrain from using probabilistic  assumptions and

332:  game-theoretic equilibrium analysis.%

333: \footnote{The debate about whether competitive ratio and non-Bayesian

334: decision making are expressive or useful for normative or descriptive

335: objectives is beyond the scope of this paper; see \cite{BTJACM99} for sound and

336: complete

337: axiomatization of the competitive ratio decision criterion.}

338:  However, one can

339: still improve on the use of competitive analysis by considering

340: minimal rationality requirements.

341:

342:

343: %\begin{definition}

344: \noindent{\bf Definition 4.1}

345: {\em

346: Given a game $G=\langle N=\{1,2,\ldots,n\},\{S_i\}_{i=1}^n,\{U_i\}_{i=1}^n

347: \rangle$, we say that a strategy $s_i \in S_i$ weakly dominates a strategy

348: $s_i^{'} \in S_i$ if $U_i(s_i,t) \geq

349: U_i(s_i^{'},t)$ for every strategy profile $t$ of the players in

350: $N \setminus \{i\}$, and there exists such strategy profile $t'$ for which

351: $U_i(s_i,t') >

352: U_i(s_i^{'},t')$. A strategy $s \in S_i$ will be called rational is there

353: is no other strategy $\bar{s} \in S_i$ that weakly dominates it.

354: Given a game $G$, the set of rational strategies for player $i$ will

355: be denoted by $Rat(S_i)$.}

356: %\end{definition}

357:

358: In any reasonable model agents will choose only from the set of

359: non-dominated strategies. Our idea is therefore to combine the

360: powerful idea of competitive analysis and this minimal

361: requirement of rationality, in order to re-introduce competitive

362: analysis into the framework of multi-agent systems.

363:

364: %\begin{definition}

365: \noindent{\bf Definition 4.2}

366: {\em

367: A strategy $s \in S_i$ is a rationally competitive strategy if $s

368: \in argmin_{t \in S_i} max_{q \in Rat(S_{-i})} Reg_i(t,q)$, where

369: $Rat(S_{-i})$ denotes the possible rational strategy profiles of

370: players in $N \setminus \{i\}$, i.e. each player $j \in N

371: \setminus \{i \}$ chooses its strategy from $Rat(S_j)$.}

372:

373: %\end{definition}

374:

375: Basically, a rationally competitive strategy applies the

376: competitive ratio decision criterion, while taking into account only

377: rational activities of the environment. As the following

378: claims illustrate, rational competitive analysis introduces

379:  an improved approach to

380: normative decision making.

381:

382: \begin{claim}

383: Given the bidding game

384: B=$(x_1,\ldots,x_n,1)$,  a rational competitive strategy for

385: agent $i$ yields a regret of $\alpha \over T$, where $\alpha$

386:  equals the upper integer

387: value of ${min(l_i,max_{j; j\neq i} l_j)-2} \over {2}$.

388: \end{claim}

389:

390: {\noindent Basic idea behind proof:} We observe that any strategy

391: that tells agent $j$ to submit a bid which is greater than or equals to

392: his valuation is dominated by the strategy of submitting his

393: valuation minus 1. Given our assumptions about the possible valuations,

394: all other strategies, excluding the strategy of

395: submitting the bid 0, are not dominated. As a result, from the

396: perspective of agent $i$, if his valuation is the highest one,  he

397: will minimize his regret if he will make  a bid that is half of the

398: distance between

399: $max_{j; j\neq i} l_j - 1$ and 1. If agent $i$'s valuation is not the

400: highest one then he will minimize his regret (again, taking into

401: account the assumptions on possible valuations) if he will make a bid

402: in between $l_i-1$ and 1.

403:

404: Notice that rational competitive analysis allows us to improve upon the

405: type of reasoning carried out in claim 3.1. Technically,

406: in the case of a bidding game with $k=1$, rationality

407: implies that we need to take the minimum between $l_i$ and the highest other

408: agents' valuation in our analysis, rather than consider  $l_i$ only.

409:

410:

411:

412: \begin{claim}

413: Given the bidding game B=$(x_1,\ldots,x_n,2)$

414:  a rational competitive strategy for agent $i$ yields a zero

415: regret.

416:

417: \end{claim}

418:

419:

420: As we can see, unlike the major effect of the rationality

421: assumption in the case of a first-price auction, there is no change

422: in the analysis in the case of a second-price auction. In the case of a

423: third-price auction, we see again the effect of the revised

424: notion:

425:

426: \begin{claim}

427: Given the bidding game B=$(x_1,\ldots,x_n,3)$,  and assume

428: w.l.o.g that $x_1 > x_2 > \cdots > x_n$, then a rational

429: competitive strategy for agent $j \ (j=1,2)$, is  to submit the

430: bid $min(2l_j-l_{[3]},T)$, where $l_{[3]}$ corresponds to the

431: 3rd highest $x_k$; a zero-regret rational competitive  strategy

432: for agent $i, 3 \leq i \leq n$, is to submit $l_i$.

433:

434: \end{claim}

435:

436: \noindent{Basic idea behind proof:} First, observe that any

437: strategy where the agent submits a bid that is below that agent's

438: valuation is dominated by the strategy that tells him to submit

439: his actual valuation as his bid. As a result, for agents $3,4,\ldots,n$

440:  there is a 0 regret in

441: submitting their  actual valuations as their bids.

442:  Let us assume that agent $i$ (where $i$ is 1 or 2)

443: submits a bid, then it can lose ${{l_i-l_{[3]}} \over T}$ if it turns out not

444: be the highest bidder (since agent $j$ submits a higher bid).

445:  On the other hand, by submitting the bid $b_i > l_i$ a loss

446: of ${{b_i-l_i} \over T}$ may be caused, since (from the perspective of agent $i$)

447: two other agents may submit the bid $b_i$.

448:  This implies that the bid

449: $min(2l_i-l_{[3]},T)$ will minimize this agent's regret.

450:

451:

452: As we can see, in the case of $k=3$ as well, rational competitive analysis

453: for bidding games leads to an improved normative approach to decision making.

454: In particular, the competitive strategy of Claim 3.3 specifies a too high bid,

455: and is not a rationally competitive strategy; as a result, it fails to

456: serve in a multi-agent context.

457:

458: \section{Rational competitive analysis in repeated games: folk theorems}

459:

460: We first recall the notion of finitely repeated games \cite{FudTir}.

461:

462: \begin{definition}

463: Given an integer $l >0$ and a game $G=\langle

464: N=\{1,2,\ldots,n\},S=\{S_i\}_{i=1}^n, \{U_i\}_{i=1}^n \rangle$,

465: a {\em repeated game} $RG=(G,l)$ with respect to $G$ is a game where

466: $G$ is repeatedly played $l$ times. $RG$ consists of the

467: following strategies and utility functions:

468: a strategy of agent $i$ in $RG$ determines the strategy of $G$ to be taken

469: by $i$ in the $k$-th iteration of $G$, as a function of the history of

470: strategies of $G$

471: selected by the others in iterations $1,2,\ldots,k-1$.

472: Given a tuple of strategies of $RG$, one for each agent, the payoff

473: for agent $i$ is the sum of its payoffs along the $l$ iterations.

474: A sub-game of a repeated game $RG$ is a repeated game that starts from

475: iteration $ 1 \leq q \leq l$ of $RG$ and consists of $l-q+1$ iterations.

476: A (rationally) competitive strategy in $RG$ is a strategy

477: that is a (rationally) competitive strategy at each of the sub-games

478: of $RG$.%

479: \footnote{This definition is in the spirit of sub-game perfect equilibrium

480: in game theory.}

481: \end{definition}

482:

483:

484:

485: Repeated games have been of much interest in the game-theory literature,

486: due to the fact they enable to study agents' actions as a function

487: of past events and other agents' actions.

488: The study of repeated games is central to the

489:  understanding of

490: basic issues in coordination and cooperation (e.g. \cite{Axelrod}),  as well

491: as for the study of  learning in games (e.g. \cite{FudLevlearn}).

492:

493:

494: One of the central challenges for  the study of repeated games is to

495: establish general theorems (titled folk-theorems)

496: that explain/recommend  behavior in

497: these (repeated) games by means of solution concepts for

498: the games they consist of. In our case, it would be of interest

499: to understand what will be a rationally competitive strategy in a

500: repeated game, and try to relate it to the competitive analysis

501: of the simple one-shot game that takes place at each iteration.

502:

503: We now present a general result about competitive analysis in

504: repeated games. For ease of presentation we will assume that $G$

505: is a two-player game, where all payoffs are distinct. We will also

506: assume w.l.o.g that  all payoffs are non-negative.  Given a

507: repeated game $(G,l)$, let us denote the highest payoff for agent

508: $i$ in $G$ by $h_i(G)$, and the second highest payoff of agent

509: $i$ in $G$ by $sh_i(G)$.

510:

511:

512: \begin{theorem}

513:

514: Given a repeated game $(G,l)$ and assume that for each agent $i$

515:  $h_i(G) \geq 2 \cdot sh_i(G)$, then

516: a rationally competitive strategy for agent $i$

517: in the game $(G,l)$ is obtained

518:  by performing the competitive strategy of it in $G$ on iterations

519:  $1,2,\ldots,l-1$ and performing the rational competitive strategy of

520: it in $G$ on the

521: last iteration.

522: \end{theorem}

523:

524: \noindent{Basic idea behind proof:}

525: %Consider the decision-making process

526: %by agent $j$, and let us denote the other agent by $i$.

527: From the

528: perspective of agent $i$, assuming we are at stage $k < l$, the

529: selection of any strategy $s$ of $G$ by $j$ can be complemented to

530: a non-dominated strategy of $j$; this non-dominated strategy will tell

531: $j$ to choose the strategy associated with $h_j(G)$ in stages

532: $k+1,\ldots,l$.  The reason

533: that the resulting strategy  is not dominated is that $j$

534: considers the strategy where $i$ will also choose in stages

535: $k+1,\ldots,l$ the  strategy (of his) in $G$ that corresponds to $h_j(G)$,

536: and does it only if in stage $k$ agent $j$ chooses $s$; in addition,

537: according to this strategy $i$ will choose the strategy that corresponds

538: to $sh_j(G)$ is stage $k$.

539: This

540: implies that agent $i$ should consider at stages $1,2,\ldots,l-1$

541: all possible strategies of agent $j$ in $G$. In the last stage

542: agent $i$ is no longer subject to the above considerations and

543: will choose the rationally competitive strategy of $G$.

544:

545:

546: The above theorem shows a strong connection between competitive analysis

547: in repeated games and competitive analysis in simple single-shot games.

548: As it turns out, this connection can be

549: further generalized to a much richer context:

550:

551: \begin{definition} Let $\bar{G}=(G_1,G_2,\ldots,G_m)$

552: be a sequence of games

553: where $N$ is the set of players in each of the games in the sequence,

554:  and game $G_i$ is played in iteration $i$.

555: The strategy of agent $t$ in $\bar{G}$ determines its strategy in

556: $G_i$, $1\leq i \leq m$,

557: as a function of the strategies of  $G_j$, $1 \leq j < i$, selected

558:  by the other

559: agents in previous iterations.

560: Given a tuple of strategies of $\bar{G}$, one for each agent, the

561: payoff of agent $i$ is taken as the sum of its payoffs in the $m$

562: iterations.

563:

564: \end{definition}

565:

566:

567:

568: \begin{theorem}

569:

570: Given a sequence of games $\bar{G}=(G_1,G_2,\ldots,G_m)$ where

571: $N$ is the set of players in each of the games in the sequence,

572:  and game $G_j$ is played in iteration $j$,

573: and assume that

574:  $h_i(G_k) \geq 2 \cdot sh_i(G_l)$ for every $1 \leq k,l \leq m$,

575: and for every agent $i$, then

576: a rationally competitive strategy for agent $i$

577: in the game $\bar{G}$ is obtained

578:  by performing the competitive strategy of $G_j$ in iterations

579:  $1,2,\ldots,l-1$ and performing the rational competitive strategy of

580: $G_m$ in the

581: last iteration.

582: \end{theorem}

583:

584: The above theorem can be generalized into a situation where $n$

585: games from among the set of games $\{G_1,G_2,\ldots,G_m\}$ are

586: executed in some random order (with possible repetitions).

587: Formally, this can be captured by the following definition and

588: theorem:

589: %\begin{definition}

590:

591: \noindent{\bf Definition 5.3}

592: {\em

593: Given a set of games $G=\{G_1,G_2,\ldots,G_m\}$, a random game

594: with respect to $G$,  $\bar{G}$,  is a sequence of $n$ games

595: $(g_1,g_2,\ldots,g_n)$, where $g_i \in G \ (1 \leq i \leq n)$

596: and $N$ is the set of players in each of the games in the

597: sequence. The game to be played in iteration $i$, $g_i$, is

598: randomly selected from the set $G$ independently of previous

599: selections made. The strategy of agent $t$ in $\bar{G}$ determines

600: its strategy in  $g_i$, $1\leq i \leq n$, as a function of the

601: strategies of $g_j$, $1 \leq j < i$, selected  by the other agents in

602: previous iterations. Given a tuple of strategies of $\bar{G}$,

603: one for each agent, the payoff of agent $t$ is taken as the sum

604: of its payoffs in the $n$ iterations.

605: A sub-game of a random game $\bar{G}$ with respect to $G$, is

606: a random game with respect to $G$ that starts from iteration $1 \leq j \leq n$

607: and consists of $n-j+1$ iterations as above.

608: A (rationally) competitive strategy in a random game is required to be

609: a (rationally) competitive strategy at each sub-game of it.

610: }

611: %\end{definition}

612:

613: \begin{theorem}

614:

615: Given a random game $\bar{G}$ with  respect to

616: $G=\{G_1,G_2,\ldots,G_m\}$, and assume that

617:  $h_i(G_k) \geq 2 \cdot sh_i(G_l)$ for every $1 \leq k,l \leq m$,

618: and for every agent $i$, then

619: a rationally competitive strategy for agent $i$ in the game

620: $\bar{G}$ is obtained

621:  by performing the competitive strategy of game $g_i$ in

622:  iterations

623:  $1,2,\ldots,n-1$ and performing the rational competitive strategy

624:  of the game $g_n$ on the last iteration in the sequence.

625: \end{theorem}

626:

627: \section{One-way trading in multi-agent systems}

628:

629: In the previous section we have discussed competitive analysis

630: for multi-agent systems in the framework of general repeated games

631: and random games. In this section we look at a particular variant

632: of repeated games that extends a well known and fundamental framework

633: for competitive analysis -- the one-way trading (see citations

634: in chapter 14 of \cite{BorElYa}).

635:

636: One way to present the structure of one-way trading is as

637: follows. An agent $a$ seeks buying $X$ units of a good or of a

638: service. A supplier $A$ wishes to supply these units of good to

639: $a$. The agents act in an environment that determines the actual

640: payment for a unit of good in a non-deterministic way. For example,

641: the payments might be specified in dollars, but since agent $A$ is

642: a British company the actual payoffs it will obtain for providing

643: the good  will depend on

644: the exchange ratio of the dollar and the

645: British pound. Formally, the environment

646: announces at each point in time, $1,2,\ldots,t$,  the payoff that

647: will be obtained by agent $A$ for supplying a  unit of good. The

648: announcements are selected in an unpredicted non-deterministic

649: manner from the interval $[m,M]$, where $M > m

650: > 0$. For example,  when  $K$ is announced at point $i$,

651: agent $A$ can supply the $X$ units of good and obtain

652: a payoff of $X \cdot K$. Our assumption is that agent $A$ will

653: obtain a zero payoff by not providing the units of good.

654:  The

655: decision problem that agent $A$ faces is as follows: at each point

656: he needs to decide whether he would like to supply the

657: units of good in the current rate. We assume that when agent $A$

658: is willing to provide the service then he will provide and be paid

659: for the whole quantity of goods requested  by agent $a$ (this property

660: is termed one-way search).

661:

662: The competitive analysis approach tells agent $A$ in the above

663: scenario to minimize his regret value. As it turns out, the

664: competitive strategy in this case will tell the agent to accept

665: the offer (i.e supply the units of good) when the payoff

666: reaches ${{M-m} \over 2}$ in stage $j\leq t-1$, and to accept

667: the offer on stage $t$ otherwise.

668: % (if we consider the

669: %multiplicative competitive ratio the term ${{M-m} \over 2}$

670: %should be replaced by $\sqrt{M \over m}$ \cite{}).

671:

672: One-way trading is a typical setting for the use of competitive

673: analysis. We now extend it to the case of several agents, where

674: more than one agent may wish to supply the units of good

675: requested by $a$. We will first develop the multi-agent

676: framework without

677: considering the rationality assumption, and then will extend it to

678: the case of rational competitive analysis.

679:

680: \subsection{Multi-agent one-way trading}

681:

682: For ease of exposition we consider the case of trading two

683: agents (i.e. two suppliers who can provide the units of good

684: requested by $a$): $A_1$ and $A_2$. The payment offers for the

685: two agents are taken to be independent. For example, agent 1 may be a

686: British company and agent 2 may be a Japanese company, and

687: therefore the actual payment offers for them (from their

688: perspective) will reflect the exchange ratio between the dollar

689: and the British pound, and the exchange ratio between the dollar and

690: the Japanese Yen, respectively. Formally, we have:

691:

692: \begin{definition}

693: Let $M_1,M_2,m_1,m_2,t,X=2K$ be positive

694: integers, where $M_1 > m_1$ and $M_2

695: > m_2$, $t \geq 3$, and $X$ is even. A multi-agent one way trading $T=\langle

696: N=\{1,2\},X,t,m_1,M_1,m_2,M_2  \rangle$ is a random game with the

697: following players, strategies,  and payoffs:

698:

699: \begin{enumerate}

700: \item

701: The players are 1 and 2.

702: \item There are $t$ iterations. Each iteration $i$ is associated with

703: a pair of numbers $(a_1,a_2)$ where $m_1 \leq a_1 \leq M_1$ and

704: $m_2 \leq a_2 \leq M_2$. At each iteration each agent can "take"

705: or "pass". However, if an agent takes in iteration $j$ then both

706: agents  can only "pass" in all iterations $j \geq i$.

707: \item The payoff of each agent in iteration $i$ is 0 if it passes;

708: if an agent performs "take" in iteration $i$ then its payoff will

709: be $a_i X$ if the other agent passes and $a_i K$ if the other

710: agent takes.

711: \end{enumerate}

712: \end{definition}

713:

714: Intuitively, "take" means a decision of accepting the offer,

715: while "pass" means rejecting it (at the given point). If both

716: agents agree to "take" then each one of them will supply half of

717: the units (and the payoff will be splitted among the agents).

718:  We

719: now show what is the structure of the competitive strategy in a

720: multi-agent one-way trading setting.

721:

722: \begin{theorem}

723: Given a multi-agent one way trading $T=\langle

724: N=\{1,2\},X,t,m_1,M_1,m_2,M_2  \rangle$, a competitive strategy for agent $i$

725: is as follows:

726:

727: \begin{enumerate}

728: \item For iterations $1 \leq j \leq t-1$, take iff $a_i \geq

729: {{2M_i+m_i} \over 4}$

730: \item If you arrive at iteration $t$ then take.

731: \end{enumerate}

732: \end{theorem}

733:

734: \noindent{Basic idea behind proof:}

735: Consider iteration $j$, $1 \leq j \leq t-1$, and consider the announcement

736: $a_i=Y$. Then, by taking in round $j$, agent $i$ might suffer a regret of

737: $2M_iK-2YK$ (notice that there is a regret when an agent takes only

738: if the other does not take at that iteration).

739: By not taking in stage $j$  agent $i$ might suffer a regret

740: of $2YK-m_iK$ (which is in fact $max(KY,2YK-m_iK)$).

741:  In order to minimize the regret agent $i$ will therefore have

742: to take whenever $Y$ satisfies that $2M_iK-2YK=2YK-m_iK$, i.e. when

743: $a_i={{2M_i+m_i}\over 4}$. The fact that the regret is minimized in iteration

744: $t$ by taking rather than passing is immediate.

745:

746:

747:

748: \subsection{Rational competitive analysis for multi-agent one way

749: trading}

750:

751: We now show the result

752: of applying rational competitive analysis to the context of

753: multi-agent one way trading:

754:

755:

756: \begin{theorem}

757: Given a multi-agent one way trading $T=\langle

758: N=\{1,2\},X,t,m_1,M_1,m_2,M_2  \rangle$, a rational

759: competitive strategy for agent $i$

760: is as follows:

761:

762: \begin{enumerate}

763: \item For iterations $1 \leq j \leq t-1$, if the other agent, $k$, is

764: announced that $a_k=M_k$, then take.

765:

766: \item For iterations $1 \leq j \leq t-2$, if (1) does not hold then

767: take iff $a_i \geq

768: {{2M_i+m_i} \over 4}$

769:

770: \item If (1) does not hold, then in iteration $t-1$

771: take iff $a_i \geq

772:  {{M_i+m_i} \over 4}$.

773: \item If you arrive at iteration $t$ then take.

774: \end{enumerate}

775: \end{theorem}

776:

777: \noindent{Basic idea behind proof:}

778: Notice that if the other agent, $k$, is announced that $a_k=M_k$ then taking

779: dominates any other strategy of it. In no other cases we can say that taking

780: or passing in iterations $1,2,\ldots,t-1$ is dominated. Also, in stage $t$

781: passing is dominated by taking.

782: As a result we will get that agent $i$ will minimize its regret by taking

783: when $a_k=M_k$ or when it arrived in the last iteration.

784: Assume that $a_i=Y$ in iteration $t-1$, then

785:  the maximal regret we get by taking  is $M_iK-2KY$, and by passing

786: the maximal regret in this case  is $2KY-m_iK$ (which is in fact

787: $max(KY,2KY-m_iK)$).  This implies that

788: the regret is minimized

789: when $M_iK-2YK=2KY-m_iK$ ), i.e.

790: when $Y = {{M_i+m_i} \over 4}$.

791: The other case that refers to iterations $1,2,\ldots,t-2$ will be treated as

792: in the case of (standard) competitive analysis.

793:

794:

795: %It is worth to notice that the above analysis made use of the fact we require

796: %from a rationally competitive strategy to be such in any sub-game.

797:

798: \section{Conclusion}

799:

800: Competitive analysis

801: is a major tool in computer science, which has been used in a variety

802: of contexts.

803: In this paper we have introduced rational competitive

804: analysis. Rational competitive analysis generalizes competitive

805: analysis to the context of multi-agent systems.

806:  Moreover, we have shown its use in the context of bidding games

807: and one-way trading, two problems of considerable importance,

808: as well as in the context of general repeated games.

809:  Our approach adopts the non-Bayesian normative

810: approach adopted in previous work, but modifies it to incorporate

811: minimal rationality requirements. Such requirements are essential in

812: multi-agent domains.

813:  Many of the previous studies in the context of  competitive analysis

814:  can be naturally extended to multi-agent

815: domains, and then rational competitive analysis can  serve as a fundamental

816: tool for the study of these extensions.

817:   We see the study of such extensions as a most attractive

818: research topic, and hope

819: that others will join us in addressing this challenge.

820:

821: \begin{thebibliography}{10}

822:

823: \bibitem{Axelrod}

824: R.~Axelrod.

825: \newblock {\em The Evolution of Cooperation}.

826: \newblock New York: Basic Books, 1984.

827:

828: \bibitem{BorodinLinialSaks}

829: A.~Borodin, N.~Linial, and M.~Saks.

830: \newblock An optimal online algorithm for metrical task systems.

831: \newblock {\em Journal of the ACM}, 39:745--763, 1992.

832:

833: \bibitem{BorElYa}

834: Allan Borodin and Ran El-Yaniv.

835: \newblock {\em On-Line Computation and Competitive Analysis}.

836: \newblock Cambridge University Press, 1998.

837:

838: \bibitem{BraTenjour}

839: R.~Brafman and M.~Tennenholtz.

840: \newblock {Modeling Agents as Qualitative Decision-Makers}.

841: \newblock {\em Artificial Intelligence}, 94, 1997.

842:

843: \bibitem{BTJACM99}

844: R.~I. Brafman and M.~Tennenholtz.

845: \newblock An axiomatic treatment of three qualitative decision criteria.

846: \newblock {\em Journal of the ACM}, 47(3), March 2000.

847:

848: \bibitem{DT97}

849: J.~Doyle and R.~Thomason.

850: \newblock Qualitative preferences in deliberation and practical reasoning.

851: \newblock Working notes of the AAAI spring symposium, 1997.

852:

853: \bibitem{FiatKarpLuby}

854: A.~Fiat, R.m. Karp, M.~Luby, L.A. McGeoch, D.D. Sleator, and N.E. Young.

855: \newblock On competitive algorithms paging algorithms.

856: \newblock {\em Journal of Algorithms}, 12:685--699, 1991.

857:

858: \bibitem{FudLevlearn}

859: D.~Fudenberg and D.~Levine.

860: \newblock {\em The theory of learning in games}.

861: \newblock MIT Press, 1998.

862:

863: \bibitem{FudTir}

864: D.~Fudenberg and J.~Tirole.

865: \newblock {\em Game Theory}.

866: \newblock MIT Press, 1991.

867:

868: \bibitem{KoutsoupiasPapadimitriou}

869: E.~Koutsoupias and C.~Papadimitriou.

870: \newblock On the k-server conjecture.

871: \newblock {\em Journal of the ACM}, 42(5):971--983, 1995.

872:

873: \bibitem{Milnor}

874: J.~Milnor.

875: \newblock {Games Against Nature}.

876: \newblock In R.~M. Thrall, C.H. Coombs, and R.L. Davis, editors, {\em Decision

877:   Processes}. John Wiley \& Sons, 1954.

878:

879: \bibitem{kpricegeb}

880: D.~Monderer and M.~Tennenholtz.

881: \newblock K-price auctions.

882: \newblock {\em Games and Economic Behavior}, 31:220--244, 2000.

883:

884: \bibitem{Wolfstetter}

885: E.~Wolfstetter.

886: \newblock Auctions: An introduction.

887: \newblock {\em Journal of Economic Surveys}, 10(4):367--420, 1996.

888:

889: \end{thebibliography}

890:

891: %\bibliographystyle{aaai}

892: %\bibliographystyle{named}

893:

894: \end{document}

895:

896: Rationalizable Competitive Strategies}

897:

898: The results presented in the previous section show how beneficial

899: rational competitive analysis can be. These aree of course most positive

900: news, but as we show now one should still be careful with the interpretation

901: of these results. In the definition of rational competitive analysis we assume that agents will not use any action   may suggest that given even

902: further restrictions on what rational behaviors

903: \begin{enumerate}

904: \item Split the previous two sections into separate   discussions of

905: competitive strategies and rationally competitive strategies.

906: \item Present rationalizable competitine strategies:

907: we would like to minize regret when the other players may use

908: an strategy that cannot be removed by a several steps in which dominant

909: strategies are removed.

910: In a third-price auction the first removal implies that agents will

911: submit at least their valuations; in a second stage this implies

912: that agents 2+ will be indifferent between submitting their valuations

913: and bidding 0; in a third stage this will imply that agent 1 will be indifferent between submitting its valuation and submitting 1; the next stage

914: will tell us however that agents may consider the submission of arbitrary

915: valuations...

916: \end{enumerate}

917:

918: \section{Folk-Theorems}

919:

920: We will look at games like prisoners dilemma, and general repeated

921: games, and talk about competitive strategies there.

922: