0502:cond-mat0502112/gt.TEX

1: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%

2: %

3: % Game Theory and Statistical Mechanics

4: %

5: %

6: %

7: % .tex author:  	Michael Campbell

8: %  start date:  	07/09/03

9: %  last rev. :	08/19/03; 09/16/03;

10: %                 11/12/03;

11: % 			04/27/04; 05/28/04 modify intro/Cournot/part.proof

12: %			11/25/04; part. proof

13:

14: \documentclass[12pt, reqno, centertags, titlepage, oneside]{amsart}

15: \usepackage{amsmath}

16: \usepackage{amsfonts}

17: \usepackage{amssymb}

18: \usepackage{enumerate}

19: \usepackage{array}

20: \usepackage{graphicx}

21:

22: % pctex

23: %\input setwmf

24:

25: %*** debug

26: %\usepackage{showkeys} %%%% draft cross-referencing

27:

28:

29: %  Margins

30: %% cf. LC p87

31: %  Horizontal

32: %  can set \hoffset to adjust the page horizontally for printer error (print using \the\hoffset)

33: \setlength{\oddsidemargin}{-.5in}

34: \setlength{\evensidemargin}{-.5in}

35: \setlength{\textwidth}{7.5in}

36: \setlength{\marginparsep}{-.25in}

37: \setlength{\marginparwidth}{0in}

38: % Vertical

39: % can set \voffset to adjust the page vertically

40: \setlength{\topmargin}{-.5in}

41: \setlength{\headheight}{0in}

42: \setlength{\headsep}{0in}

43: \setlength{\textheight}{9.75in}

44: \setlength{\footskip}{.5in}			% distance from bottom of text to bottom of footer

45:

46:

47: \setlength{\parindent}{1em}			% paragraph indentation

48: \setlength{\parskip}{1\baselineskip}			% space between paragraphs.

49:

50:

51: \pagestyle{plain}

52: \begin{document}

53:

54:

55:

56: \title{A Gibbsian Approach to Potential Game Theory}

57: \author{Michael Campbell\\ \\

58: 	  version 8.1 Feb 2005}

59: \address{Innovative Research Concepts, Anaheim, CA, USA}

60: \email{mcampbel123@yahoo.com}

61: \begin{abstract}

62: In games for which there exists a potential, the deviation-from-rationality dynamical

63: model for which each agent's strategy adjustment follows the gradient of the potential along with a

64: normally distributed random perturbation, is shown to equilibrate to a Gibbs measure.

65: The standard Cournot model of an oligopoly is shown not to have a phase transition,

66: as it is equivalent to a continuum version of the Curie-Weiss model.

67: However, when there is increased local competition among agents, a phase transition

68: will likely occur.  If the oligopolistic competition has power-law falloff

69: and there is increased local competition among agents, then the model has a rich phase

70: diagram with an antiferromagnetic checkerboard state, striped states and maze-like states

71: with varying widths, and finally a paramagnetic state.

72: Such phases have economic implications as to how agents compete

73: given various restrictions on how goods are distributed.  The standard Cournot model

74: corresponds to a uniform distribution of goods, whereas the power-law variations

75: correspond to goods for which the distribution is more localized.

76: \end{abstract}

77:

78: \maketitle

79:

80:

81:

82: %\begin{center}

83: %\Large

84: %A Gibbsian Approach to Potential Game Theory\\

85: %\ \\

86: %\normalsize

87: %Michael Campbell

88: %\vspace{.5in}

89: %\end{center}

90:

91:

92: \newtheorem{lemma}{Lemma}

93: \newtheorem{theorem}[lemma]{Theorem}

94: \newtheorem{proposition}[lemma]{Proposition}

95: \newtheorem{corollary}[lemma]{Corollary}

96: \newtheorem{axiom}[lemma]{Axiom}

97:

98:

99: \theoremstyle{remark}

100: \newtheorem*{definition}{Definition}

101: \newtheorem*{remark}{Remark}

102: \newtheorem*{remarks}{Remarks}

103: \newtheorem{example}{Example}

104: \newtheorem*{ack}{Acknowledgments}

105:

106: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%

107: %  Macros

108: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%

109:

110: % GENERAL LINGO ; may need to follow with '/nt' to eliminate extra space

111: \newcommand{\gtApproach}{{Gibbs }}

112: \newcommand{\dfr}{{deviations-from-rationality }}

113:

114:

115: % commands

116: \newcommand{\nt}{\negthickspace}

117: \newcommand{\ntfive}{\nt \nt \nt \nt \nt}

118: \newcommand{\nttwenty}{\ntfive \ntfive \ntfive \ntfive}

119:

120: % in mathmode only

121: \newcommand{\unitx}{\mathbf{\hat{x}}}

122: \newcommand{\unity}{\mathbf{\hat{y}}}

123: \newcommand{\unitz}{\mathbf{\hat{z}}}

124: \newcommand{\con}[1]{\overleftrightarrow{_{#1}} }

125:

126: \newcommand{\lemRef}[1]{Lemma~\ref{#1}}

127: \newcommand{\thmRef}[1]{Theorem~\ref{#1}}

128: \newcommand{\propRef}[1]{Proposition~\ref{#1}}

129: \newcommand{\corRef}[1]{Corollary~\ref{#1}}

130: \newcommand{\exRef}[1]{Example~\ref{#1}}

131: \newcommand{\remRef}[1]{Remark~\ref{#1}}

132: \newcommand{\axRef}[1]{Axiom~\ref{#1}}

133: \newcommand{\eqRef}[1]{equation~\eqref{#1}}

134: \newcommand{\primeEqRef}[1]{equation~\textup{(\ref{#1}$'$)}}

135: \newcommand{\primeeqref}[1]{\textup{(\ref{#1}$'$)}}

136: \newcommand{\secRef}[1]{Section~\ref{#1}}

137: \newcommand{\appRef}[1]{Appendix~\ref{#1}}

138: %

139: %

140: % notations: valid only in math mode

141: %

142: %

143: \newcommand{\np}{{n}}					% number of agent

144: \newcommand{\vol}{\Lambda}				% site set of agents

145:

146: \newcommand{\vty}{\chi}				% volatility/susceptibility

147:

148:

149: % minority game: stategy and number-of-strategies

150: \newcommand{\strat}{\mathfrak{s}}

151: \newcommand{\strats}{S}

152:

153:

154: \newcommand{\vx}{\vec{x}}

155: \newcommand{\vw}{\vec{w}}

156: \newcommand{\vy}{\vec{y}}

157: \newcommand{\grad}{\vec{\nabla}}

158:

159: \newcommand{\vq}[1][{\np}]{(q_1,\dots,q_{#1})}

160:

161: \newcommand{\eval}[1]{|_{#1}}			% evaluation bar

162:

163:

164: \newcommand{\btt}{\tilde{\beta_t}}		% tilde beta in appendix B

165: \newcommand{\bto}{\tilde{\beta_0}}

166:

167: % bracket expectation

168: \newcommand{\br}[1]{\langle {#1} \rangle}			% regular brackets

169: \newcommand{\brf}[1]{\langle {#1} \rangle^{\wedg}}	%Fourier transform of Gibbs measure

170: \newcommand{\Br}[1]{\left\langle {#1} \right\rangle}	% auto sized regular brackets

171: \newcommand{\brn}[2][{}]{\langle {#2} \rangle_{n_{#1}} }	% finite volume reg. brackets

172: \newcommand{\brfn}[2][{}]{\langle {#2} \rangle^{\wedge}_{n_{#1}} }

173: \newcommand{\brh}[1]{\langle {#1} \rangle_h}

174: \newcommand{\Brh}[1]{\left\langle {#1} \right\rangle_h}

175: \newcommand{\brnh}[1]{\langle {#1} \rangle_{n,h}}

176:

177: \newcommand{\qbr}[1]{[\![ {#1} ]\!]} 				%quenched brackets

178: \newcommand{\Qbr}[1]{\left[\!\left[ {#1} \right]\!\right]}

179:

180:

181: \newcommand{\qmin}{{\underline{q}}}

182: \newcommand{\qmax}{{\bar{q}}}

183:

184:

185: % math-mode functions

186: \newcommand{\adj}[1]{ {#1}^{\!\text{*}} }

187:

188: \newcommand{\e}{\operatorname{e}}

189: \newcommand{\E}{\operatorname{E}}

190: \newcommand{\C}{\operatorname{C}}

191: \newcommand{\F}{\operatorname{F}}

192: \newcommand{\Arg}{\operatorname{Arg}}

193:

194:

195: \newcommand{\sinc}{\operatorname{sinc}}

196: \newcommand{\sinhc}{\operatorname{sinhc}}

197:

198:

199: \newcommand{\sgn}{\operatorname{sgn}}

200: \newcommand{\argmax}{\operatorname{argmax}}

201:

202: \newcommand{\Prob}{\operatorname{Prob}}

203:

204: % bond notation

205: \newcommand{\edge}{e}					% edge in bond space

206: \newcommand{\bond}[1][ij]{\edge_{#1}}		% bond in bond space - a specific edge (use: \bond[ab] )

207: \newcommand{\bondConfig}[1][{}]{\omega_{#1}}	% bond configuration - represents a number or function

208:

209: \newcommand{\cfg}{\underline{\sigma}}		% spin configuration

210:

211:

212: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%

213: %

214: % section: Introduction

215: %

216: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%

217: \section{Introduction} \label{S:Introduction}

218: 	Certain models in game theory \cite{AGH1,AGH2,BS,BSV,HSa1,HSa2,MaZ,MaZ2,Sa1,Sa2} analyze the dynamics of decisions made by agents who adjust their decisions in the direction of higher payoffs, subject to random error and/or information (deductive ``\dfr\nt'' models, cf. \cite{Bl}).  These errors, which are essentially failures to choose the most optimal payoff, are understood to be intrinsic to the agents, and can be due to stochastic elements such as preference shocks, experimentation, or actual mistakes in judgment.  In all of these contexts, the error is assumed to be due to intrinsic properties of the agents; i.e., the error is due to the \emph{agents} making decisions that deviate from the true, optimal decision.  These stochastic approaches commonly use a Langevin equation to deduce logit equilibrium measures for each individual agent.

219:

220: 	The logit measures had been earlier discovered by different reasoning \cite{L, McF}.  Logit measures are exploited in \cite{AGH1, AGH2}, where agents use a continuum of pure strategies with decisions perturbed by white noise, and additionally, agents can use a restricted form of mixed strategies which are absolutely continuous with respect to the Lebesgue measure.  A purely dynamical approach is taken in \cite{MaZ, MaZ2}, where a continuum of pure strategies is assumed and the agents are restricted to using only these pure strategies with their decisions perturbed by white noise.  If agents are restricted to play a single pure strategy at any given point in time as in \cite{MaZ, MaZ2}, mixed strategies in \cite{AGH1, AGH2} can be viewed as a time-average of the agent's pure strategies.  Since averages are effectively used in the stationarity equations of \cite{AGH1, AGH2}, it is not surprising that the solutions for the logit measures are mean-field theoretic in nature insofar as they must find fixed points for these equations to solve them.

221: A mean-field static equilibrium is also found for a discrete Ising-type logit model in \cite{BlD}.

222: The dynamics of a random utility model are also analyzed in \cite{BlD}, and shown to have the same

223: stationary state as with the mean-field analysis.  The static equilibrium is that for a

224: Curie-Weiss model.  Such concepts will be examined below within the context of statistical

225: mechanics.

226: For the dynamics presented in this paper, we will assume that agents are \emph{myopic} in pure strategy space; that is they play a single pure strategy at any point in time as in \cite{MaZ, MaZ2}, and can only make infinitesimal adjustments in pure strategy space over infinitesimal time.  The difference between the dynamical approach in this paper versus those in \cite{AGH1, AGH2, Bl,

227: HSa1, HSa2, MaZ, MaZ2} is that a global approach is used

228: 	\footnote{A single dynamical equation tracks all agents, instead of one equation

229: 		    for each agent.},

230: in full exploitation of potential games \cite{MS}.  As a result, a single stationary measure results, instead of multiple coupled measures in \cite{AGH1,AGH2}.  An explicit form of the measure

231: is also obtained in contrast to the aforementioned references.

232:

233: It is noted that to allow agents to make effectively large moves in pure strategy over infinitesimal time (effectively what happens in \cite{AGH1, AGH2}) is akin to allowing mixed strategies, and as a result, Nash equilibria can be found much more quickly.  As an example, consider pure-strategy quantities $q_1$, $q_2$, and $q_3$ which are positive real numbers.  The dynamics of this paper does not allow a quick jump from $q_1$ to $q_2$ since $|q_1-q_2|>0$, where $|\cdot|$ is the distance function on the line.  However, with mixed strategies, an agent can shift from $q_1$ to $(1-\epsilon)q_1 + \epsilon q_2$ for small epsilon, and get a quick sense of the strategy $q_2$ from $q_1$ and shift towards $q_2$ in one step if it is better.  This can happen since $\|q_1 - [(1-\epsilon)q_1 + \epsilon q_2]\|<2\epsilon$, where $\|\cdot\|$ is the distance function on the Banach space of Radon measures.  This is similar to classical, noiseless potential game theory when the Nash equilibrium is quickly and efficiently found to be a pure state (i.e., the global interior maximum of the potential function) since agents can make large unilateral changes in decision.  This does not happen in the myopic, noisy dynamics presented here at nonzero temperature, as is evident from the resulting mixed strategy Gibbs/logit stationary state.  The classical game-theory equilibrium (i.e., Nash equilibrium)

234: is attained from the Gibbs measure at zero temperature.  A thorough analysis of types of equilibria

235: that occur in potential games can be found in \cite{Sa1}.

236:

237: 	In addition, an alternative view is proposed in this paper, in the context of potential games.  The same Gibbs/logit equilibrium distribution that results from myopic, noisy dynamics can be derived using certain axioms.  This avoids explicit reference to error in agents' decisions and extends easily to games with discrete pure strategy spaces.  These axioms are in fact those for equilibrium thermodynamics and are used to do large deviation theory.  Therefore this alternative approach is empirical in nature: our model of agent behavior using thermodynamic axioms is based on many observations rather than the intrinsics (i.e. dynamics) of the game.  It will be shown that this \emph{\gtApproach\nt} approach to games with a continuum of strategies is equivalent to the aforementioned global, noisy, myopic \dfr potential game.  As in classical statistical mechanics, we will assume the observables (i.e., functions of strategies) to be $C(\Omega)$, the continuous functions on pure strategy space.  Hence the general mixed strategies will be the probability states in the dual of $C(\Omega)$, $R_1(\Omega)= \{\mu \in C(\Omega)^* |\, \mu \geq 0, \mu(1)=1\}$.  That is to say the mixed strategies will be generalized to probability Radon measures on $\Omega$, which can include density functions with Lebesgue measure (such as the Gibbs measure), and delta functions for pure states (zero-temperature/pure-rational limits of Gibbs measures).

238:

239: Since we only use one dynamical equation, only one stationary logit measure is obtained and we will call this measure the \emph{Gibbs measure} of the system, exactly as in statistical mechanics.  This will also avoid confusion with the more general case presented in \cite{AGH1, AGH2} of multiple logits for each agent.  The Gibbs measure here is one for the entire system of agents in the given game which results from the global dynamical equation.  This is a very interesting feature of potential games with noisy, myopic dynamics.  It is stated in \cite{MaZ2} that ``...in game theory, there is not a unique Hamiltonian, but rather each agent has her own Hamiltonian to minimize.''  This is true in general, but we'll show that we do have a unique Hamiltonian in the case of potential games.  The idea of looking for something global to unify dynamics was touched upon in \cite{BCP} with the use of a ``global cost function'' in the `minority game', but the technique is through the use of another dynamical method called ``inductive reasoning'' \cite{A}, rather than errors to deductive reasoning.  Within inductive reasoning, an explicit formula for the stationary measure in terms of the state of the agents does not appear.  The equilibrium measures obtained here do have such an explicit formula: a Gibbs measure in terms of the potential.  The potential was pointed out in \cite{AGH1} to be the common part of the payoff functions that includes the benefits or costs that are determined by each agent's own decisions (e.g., the effort costs).  Mathematically, the potential is just like the negative energy of all of the particles (i.e. agents) in a physical system (i.e. game) for a given state (i.e. point describing each agent's decision).

240:

241: 	The essence of the thermodynamic approach is that we are measuring many cases of the same game, as one would repeat an experiment many times to get average results; i.e., an \emph{ensemble average}.  The errors are not explicitly encoded at the (microscopic) level of individual agent decisions in this approach.  However, this endogenous noise is implicitly assumed on a macroscopic level by requiring the ensemble average of the potential be constant.

242: 	\footnote{If one game in the ensemble gains potential, another must

243: 	lose potential which is counter to purely rational behavior.}

244: The average values are due to making many measurements and averaging the results over the \emph{ensemble} of the multitude of games being observed.  Hence it's assumed that we don't know the details of probabilities of being in various states, but rather that we must deduce these probabilities from axioms of empirical observation.  The size of the ensemble (i.e., the number of representative games we observe) must be very large so that statistical laws of large numbers hold, and that the empirical probabilities reflect the true intrinsic probability distribution of a single game.  However, unlike the ideas proposed in \cite{Ma, MaZ, MaZ2}, we do not need to assume that there are a large number of agents and we do not treat a single agent as though she is a system in a `heat bath' of agents, which implies an ensemble of agents and that we are only looking at one game.  Rather, we treat the entire game as a system, and look at an ensemble of many identical games.  This approach also allows an easy generalization to discrete/finite decisions.  The partition function is introduced systematically and axiomatically, rather than as a post-hoc mechanism to solve a minimum free energy problem in inductive models as in \cite{CGGS, MaCZ}.  The logit measure follows very naturally from the framework presented here.  The limitation is that, for now, a systematic introduction of a partition function (i.e., explicit stationary measure) is only justified for potential games, whereas \cite{AGH1, AGH2, MaCZ} is more general since agents have individual measures.

245:

246: 	An interesting aspect of this \gtApproach approach is that the coefficient that determines the importance of errors in a stochastic potential game (the $\mu_i=2/\sigma^2$ in \cite {AGH1, AGH2} or the $2/\nu^2$ in \eqRef{E:FokkerPlanck}) is directly proportional to the usual notion of temperature in statistical mechanics.  This was also discussed in \cite{Ma} for individual agents.  In fact, in the kinetic theory of gases in equilibrium, the temperature $T$ is proportional to the average of the square of velocities $v$ of the gas particles ($T = k\cdot\br{v^2}$ for a constant $k$).   If a game has a potential that is the mathematical equivalent of such a gas, this equilibrium `kinetic' theory states that the `temperature' of that game is proportional to the average of the square of the temporal rates-of-change of decisions ($T = c\cdot\br{[dx/dt]^2}$, where $x\in[\underline{x},\bar{x}]$ represents an agent's decision).  Hence larger instantaneous changes in decisions by agents correspond to higher temperatures of the game, and in general, temperature is a measure of the rate-of-change in decisions.  In the dynamical sense \cite{AGH1, AGH2}, temperature measures agents' deviation from rationality.

247:

248: 	A major application of statistical mechanics is to take infinite-agent limits from finite agent games, and an example of this will be done to show the classical result of oligopolies approaching perfect competition as the number of agents goes to infinity.  This requires some modest technical restrictions to be made on the potential function so that the finite-agent Gibbs measures will converge in the infinite-agent limit (c.f. \cite{Si}); however these restrictions also make clear economic sense.  A major advantage with the \gtApproach approach for potential games is that, for various potentials, all of the rigorous techniques for phase transitions can be implemented as well as the process of renormalization of infinite-agent games.  Renormalization would allow us to look at behavior of aggregate blocks of agents at various scales. Dynamical renormalization is also very recently being done in economics \cite{FJL, FL}, where large numbers of small firms are ``agglomerated'' and the dynamics of the resulting ``meta-firms'' are examined via a sandpile model. It will only be mentioned in this paper that renormalization can be done.  The standard real-space renormalization is applicable to any potential game with the imposition of a lattice dimension, since all agents interact in many classic economic models.  It would also be interesting to compare dynamical renormalization of the noisy myopic dynamics with the real-space renormalization of the Gibbs measure.  Hopefully, the model shown here may be fruitful in understanding how local and global interactions between agents unify; that is to say a better understanding and more systematic treatment of economic interactions at small and large scales.  Renormalization may link different economic scales as it does different scales in physics.

249:

250:

251: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%

252: %

253: %  SECTION: Potential games and axioms

254: %

255: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%

256: \section{Potential Games and Axioms} \label{S:pGamesAxioms}

257:

258: 	Let us consider a game with a finite number of $\np$ agents, and all of these agents belong to the set $\vol$.  At any moment in time, agent $i\in\vol$ can select an action or strategy $x_i \in A_i$ and the $x_i$ are the \emph{decision variables}.  A \emph{configuration} $\vx$ of the system is any possible state of the system:  $\vx = (x_1, x_2, \dots, x_\np)$, where each $x_i \in A_i$.  The set of all possible configurations of the game is $\Omega \equiv \prod_i A_i$.  A typical example of $A_i$ is an interval of real numbers $A_i=[\underline{x},\bar{x}]$.  The development below is for continuum decisions, and discrete decisions will be accommodated later.

259:

260: In this article, we will only consider potential games \cite{MS}.  Let

261: 	\begin{equation}

262:  	V(\vx)=V(x_1,x_2,\dots,x_\np):\Omega\rightarrow(-\infty,\infty)

263: 	\label{E:potential}

264: 	\end{equation}

265: be the potential function for the game.  Recall that a potential game is one such that every agent's payoff function $u_i(\vx)$ satisfies

266: 	\begin{equation}

267: 	\frac{\partial u_i}{\partial x_i} = \frac{\partial V}{\partial x_i}

268: 	\label{E:potentialCond}

269: 	\end{equation}

270: for some function $V(\vx)$.

271:

272: Certain conditions must be imposed upon $V$ in order that probability distributions exist when the number of agents is taken to infinity in a limit.  These conditions are well known functional-analytic restrictions to ensure that infinite-volume limits exist (c.f. \cite{Si}).  Basic postulates for a game in equilibrium will be introduced (c.f. \cite{R}).  These are just the usual thermodynamics, and the probability of finding the game in a particular state can be derived from these postulates of empirical observation.

273:

274: \begin{axiom}[conservation of energy] \label{A:axiom1}

275: The game is \emph{isolated} and agents act in such a way that the ensemble average of the potential is a constant.\end{axiom}

276:

277: The `isolated' requirement excludes outside sources from changing the potential for a given set of choices.  Even though agents in a specific game in the ensemble try to maximize potential, the intrinsic noise will introduce errors into agents' decisions such that an increase in potential in one game will correspond to a decrease in the potential in one or more other games in the ensemble.  Since purely rational agents attempt to maximize potential, this axiom precludes perfect rationality unless the ensemble average of the potential \emph{is} the maximum potential (this is the zero-temperature case).

278:

279: \begin{axiom}[equilibrium/stationarity] \label{A:axiom2}

280: The isolated game is in \emph{equilibrium}.  This is to say that the probability of finding the game in any one state (i.e., a single representative from the ensemble) is independent of time.

281: \end{axiom}

282:

283: This is the usual commonsense notion of equilibrium.  From \axRef{A:axiom2}, all \emph{macroscopic} parameters (those that describe the system as a whole) are then also time-independent.  Macroscopic parameters will be described in more detail later, but examples are the average output per agent, the average payoff, and the temperature.

284:

285: \begin{axiom}[ergodicity] \label{A:axiom3}

286: The isolated game in equilibrium is equally likely to be in any of its accessible states.

287: \end{axiom}

288:

289: An \emph{accessible state} $\vx$ is any configuration of decisions $\vx\in\Omega$ with $V(\vx)=V_0$, given a fixed observed potential $V_0$ for the game (i.e., we look at all games in the ensemble with potentials of $V_0$).  If a potential game with potential $V$ has payoff functions $u_i(\vx)$ for each agent $i\in\vol$, then

290: 	\begin{equation}

291: 	\frac{\partial}{\partial x_i} u_i(\vx) =

292: 		\frac{\partial}{\partial x_i} V(\vx).

293: 	\label{E:gradV}

294: 	\end{equation}

295: If the potential for $\np$ agents is restricted to a constant value $V_0$ (i.e., we look at a constant-potential surface in $\Omega$), then the equilibrium dynamical system of \cite{AGH1} is ergodic.  Fluctuations will cause a system to move among its accessible states over time with equal frequency, and ergodicity means essentially that the system, over time, will visit each point on the constant potential surface.  As such, it is equivalent to assume all points on the constant $V_0$ surface have the same probability of being observed.  An interesting consequence of this that the condition $\partial V/\partial t =0$ does not necessarily imply an equilibrium (c.f. \cite{AGH1}).  If the system were at a local maximum of $V$ (at say $\vx_0$) that isn't a global maximum, then \axRef{A:axiom3} implies the system can't stay at the single point $\vx_0$.  Rather, over time, it will move on the whole surface $V=V(\vx_0)$.  Since $V(\vx_0)$ isn't a global maximum, the system will reach another point with the same potential but with nonzero gradient, and the dynamics can move the system towards a higher potential than $V(\vx_0)$.

296:

297: 	The approach to stochastic game theory in \cite{AGH1} produces a single Fokker-Planck equation that the joint probability distribution of decisions on $\Omega$ satisfies.  There, the Langevin equations for the joint distribution are

298: 	\begin{equation}

299: 	dx_i(t) = \frac{\partial}{\partial x_i} u_i(\vx,t)dt + \nu \, dw_i(t),

300: 		\: 1\leq i\leq\np,

301: 	\label{E:Langevin}

302: 	\end{equation}

303: where the $w_i$ are zero-mean, unit-variance normal random variables and $\nu$ is a variance parameter.

304:

305: The following result very similar to \cite{AGH1}.  The major difference here is that we are looking at a dynamics for all agents who only use pure strategies, whereas in \cite{AGH1} single-agent dynamics are analyzed separately, and they use mixed strategies.  The point of the following result is to show the stochastic dynamical approach in global form leads to the same Gibbs measure as that derived from \axRef{A:axiom1}, \axRef{A:axiom2}, and \axRef{A:axiom3}.  The proof is in \appRef{S:appendixA}.  Again, it is important to note that the dynamics below assume agents only use pure strategies and can only make small-distance moves over pure strategy space within a small time period (myopic agents).

306:

307: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%

308: %  Prop - potential game?

309: %

310: \begin{proposition} \label{P:potentialGame}

311: Let $f(\vx)$ be the joint density over decision space $\Omega$ for a potential game with a finite number of agents $\np$ and potential $V$.  Consider the dynamics

312: 	\begin{equation}

313: 	d\vx = \grad{V}dt + \nu d\vw(t),

314: 	\label{E:langevin}

315: 	\end{equation}

316: where $x\in\Omega$, $d\vx=(dx_1,\dots,dx_\np)$, $\grad{V}=(\partial V/\partial x_1, \dots, \partial V/\partial x_\np)$, and $\vw(t)=(w_1(t),\dots,w_\np(t))$ with the $w_i(t)$ standard Wiener (or white noise) processes which are identical and independent across agents and time.  Furthermore, the $w_i(t)$ have mean zero and variance one and reflecting boundary conditions

317: 	\footnote{This requires zero time derivatives on the boundary,

318: 	specifically that \eqref{E:statState} be satisfied for boundary points $\vx$.}

319: are used.  Note that no conditional averages are done on $V$ above, as opposed to \cite{AGH1, AGH2}.  This indicates that only pure strategies are being played.

320:

321: If the process $\vx(t)$ satisfies the dynamics of \eqref{E:Langevin}, then the joint density satisfies the Fokker-Planck equation

322: 	\begin{equation}

323: 	\frac{\partial f(x,t)}{\partial t} =

324: 	  -\grad\cdot[\grad{V}(\vx(t)) f(\vx,t)] + \frac{\nu^2}{2} \nabla^2 f(\vx,t)

325: 	\label{E:FokkerPlanck}

326: 	\end{equation}

327: and the corresponding equilibrium measure for variance $\nu^2$ is the Gibbs state

328: 	\begin{equation}

329: 	f(\vx,t) = f(\vx) =\frac{exp\left( \frac{2}{\nu^2} V(\vx) \right)}

330: 				{\int_\Omega exp\left( \frac{2}{\nu^2} V(\vy) \right) d\vy}.

331: 	\label{E:Gibbs}

332: 	\end{equation}

333: \end{proposition}

334:

335: In statistical mechanics, the term in the exponent of \eqref{E:Gibbs} is $-E(\vx)/(kT)$, where $k$ is Boltzmann's constant, $T$ is temperature, and $E(\vx)$ is the energy of configuration $\vx$.  Hence the analogy of a potential game to statistical mechanics is that $\nu^2$ (deviation from rationality; influence of the noise in dynamics \eqref{E:Langevin}) is proportional to `temperature' and the potential $V$ is the negative `energy' of the system.

336:

337:

338: Now the Gibbs measure above can be derived using only the axioms of thermodynamics \axRef{A:axiom1}, \axRef{A:axiom2}, and \axRef{A:axiom3}.  The proof is the standard proof of the `canonical distribution' in any text on thermodynamics, and is the one that finds a state (i.e. probability measure) that maximizes entropy under the condition of a constant ensemble-average energy.  The variational form of this maximization problem is the Gibbs variational principle of finding a state that maximizes the Helmholtz free energy.  It is interesting to note that the problem of finding a state that maximizes the Liapunov function (for dynamical equilibrium) in \cite{AGH1} is identical to finding a state that minimizes the Helmholtz free energy of a given potential game.  This is exactly what the Gibbs variational principle accomplishes.  It is no surprise that the Liapunov function in \cite{AGH1} \emph{is} the negative Helmholtz free energy for corresponding potential game with no explicit noise.  This fact is stated here as a contrast to \propRef{P:potentialGame}:

339:

340: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%

341: %  Prop - Potential game like stat. mech./ basic axioms stat mech.

342: %

343: \begin{proposition} \label{P:potentialGameAxiom}

344: Let $f(\vx)$ be the joint density over decision space $\Omega$ for a potential game with a finite number of agents $\np$ and a potential $V$.  Suppose  the three thermodynamic axioms \axRef{A:axiom1}, \axRef{A:axiom2}, and \axRef{A:axiom3} hold.  Then the equilibrium measure for this potential game is the Gibbs state at inverse-temperature $\beta\equiv(kT)^{-1}$:

345: 	\begin{equation}

346: 	f(\vx,t) = f(\vx) =\frac{exp\left( \beta V(\vx) \right)}

347: 				{\int_\Omega exp\left( \beta V(\vy) \right) d\vy}.

348: 	\label{E:Gibbs2}

349: 	\end{equation}

350: \end{proposition}

351:

352: Here the constant $\beta$ in $f(\vx)$ is determined so that the mean potential $\int_\Omega V(\vx) f(\vx) d\vx=\bar{V}$, where $\bar{V}$ is the ensemble-average observed potential (i.e., the sum of the potentials of each game in the ensemble divided by the number of games $g$ in the ensemble $(V_1+\cdots+V_g)/g$).  This is equivalent to the notion of considering one game to reside within the `heat-bath' of the other games in the ensemble with a fixed total ensemble potential $V_1+\cdots+V_g$.

353:

354: We consider equilibrium states of an infinite-agent game to be the measure(s) resulting from the appropriate infinite-agent/volume limit of the Gibbs state.

355:

356: In the case of potentials that fit the usual notion of an `interaction'

357: 	\footnote{It is noted that the potential for a Cournot oligopoly

358: 	is \emph{not} such an `interaction', and other techniques must be

359: 	used.  This is because of the $1/\np$ factor; see \secRef{S:Cournot}

360: 	for the explicit potential.}

361: in statistical mechanics (c.f. \cite{Si} for the appropriate machinery), the infinite-agent measures will be consistent with the finite-agent measures via the Dobrushin-Lanford-Ruelle equations.  Such techniques can be used directly to analyze possible phase transitions for an infinite-agent potential games.  We can then consider equilibrium states of an infinite-agent game to be the appropriate infinite-volume limit of the Gibbs measure.

362:

363:

364:

365:

366:

367: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%

368: %

369: %

370: %

371: %  SECTION: Examples

372: %

373: %

374: %

375: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%

376:

377:

378: \section{Example: Cournot Oligopoly, Local Demand, Perfect Competition and Collusion} \label{S:Cournot}

379: The simplest oligopoly model is due to Augustin Cournot (1838).  There is one homogeneous good with demand function $p(Q)$, where $Q$ is the total quantity of the good produced.  Given an oligopoly of $\np$ firms, if each firm produces an amount $q_i$ of the good, then

380: 	\begin{equation}

381: 	Q = \sum_1^{\np} q_i.

382: 	\label{E:Q}

383: 	\end{equation}

384: For the sake of later analysis, each firm's production $q_i$ will be scaled between a minimum production $\qmin\geq0$ and a maximum production $\qmax>\qmin$.  We will assume each firm produces a sufficiently large number of the good so that the $q_i$ can be regarded as continuum variables $\qmin\leq q_i\leq\qmax$, $1\leq i\leq \np$.  For example, $q_i=1$ can represent the production of a sufficiently large number of goods.  It is noted that smaller production would be handled in the discrete case as in \secRef{S:MG}.

385:

386: Each firm uses quantity $q_i$ as its strategy based on the payoff function

387: 	\begin{equation}

388: 	\Pi_i = q_i p(Q) - C_i(q_i),

389: 	\label{E:generalPayoff}

390: 	\end{equation}

391: where $C_i$ is the $i$th firm's cost function.  We will assume a linear (inverse) demand function

392: 	\begin{equation}

393: 	p(Q) = a - \frac{b}{\np} Q,

394: 	\label{E:linearDemand}

395: 	\end{equation}

396: with constants $a>0$ and $b>0$.  Notice that $b$ is divided by $n$ so that demand is based on the average production.  Thus demand stays nonnegative for large $n$.  For example, if each firm were to produce $\qmax$, $p(n\qmax) = a - b\qmax$ is well-behaved and non-trivial (i.e., doesn't go to negative infinity or zero).

397:

398: Constant marginal costs (i.e., $C_i'=c$, for a constant $c>0$) will also be assumed, so that $C_i(q_i) = c q_i + d_i$, and the payoff functions are now

399: 	\begin{equation}

400: 	\Pi_i = q_i \frac{-b}{\np}\sum_1^n q_j + (a-c)q_i - d_i.

401: 	\label{E:payoff}

402: 	\end{equation}

403:

404: In the case of collusion, firms would try to maximize total industry profits

405: 	\begin{equation}

406: 	\Pi = \sum_1^{\np} \Pi_i = -\frac{b}{\np}\sum_{i,j=1}^{\np} q_i q_j

407: 				+ (a-c)\sum_1^{\np} q_i - \sum_1^{\np} d_i.

408: 	\label{E:payoffC}

409: 	\end{equation}

410:

411: Both of these cases constitute potential games.  A potential for the oligopoly in \eqRef{E:payoff} is

412: 	\begin{equation}

413: 	V_o = -\frac{b}{2\np} \sum_{i,j} q_i q_j - \frac{b}{2\np}\sum q_i^2

414: 		+ (a-c)\sum q_i

415: 	\label{E:potentialO}

416: 	\end{equation}

417: and a potential for collusion is the same as the collusion payoff function (except for the irrelevant constants $d_i$) in \eqRef{E:payoffC},

418: 	\begin{equation}

419: 	V_c = -\frac{b}{\np} \sum_{i,j} q_i q_j + (a-c)\sum q_i

420: 	\label{E:potentialC}

421: 	\end{equation}

422: These two potentials are two cases of the potential

423: 	\begin{equation}

424: 	V = -\frac{b}{\np} \sum_{i,j} q_i q_j - \frac{\tilde{b}}{\np}\sum q_i^2

425: 		+ (a-c)\sum q_i.

426: 	\label{E:potentialCO}

427: 	\end{equation}

428: To get the potential in \eqRef{E:potentialO}, replace $b$ and $\tilde{b}$ in \eqRef{E:potential} with $b/2$.  To get the collusion potential in \eqRef{E:potentialC}, set $\tilde{b}$ to

429: zero.

430: 	\footnote{

431: 	It is interesting to note the effect of collusion pertaining to rationality.

432: 	If $a-c=0$ in \eqref{E:potentialCO}, then collusion has the effect of doubling

433: 	$\beta$ in the partition function \eqref{E:partitionCO}.  The collusive model

434: 	is simply the non-collusive oligopolistic model at half the temperature

435: 	$1/(2\beta)=T/2$.  In the infinite-agent limit, the oligopolistic model is

436: 	the same as perfect competition.

437: 	Hence collusion is more ``rational'' behavior than perfect competition!}

438:

439: The Nash equilibrium for the Cournot oligopoly with potential $V$ above is

440: 	\begin{equation}

441: 	q_j^* = \frac{a-c}{2( b + \tilde{b}/\np ) }.

442: 	\label{E:qEquil}

443: 	\end{equation}

444: For non-collusion, $q_j^* = [\np/(\np+1)] [(a-c)/b]$, and for collusion $q_j^* = (a-c)/(2b)$ are the Nash equilibria.

445: 	\footnote{These equilibria look like those for \emph{total} industry output

446: 		    $Q=\sum q_i$ in the standard literature, but as described below

447: 		    \eqref{E:linearDemand}, the scaling is at the level of firm production

448: 		    instead of industry production (where firm production would go to zero

449: 		    as $\np$ increases).}

450: Note for collusion, the output is half of that for the non-collusive model, and hence firm profit \eqref{E:payoff} is less than it would be for the non-collusive equilibrium.

451: These classical equilibrium solutions will be recovered in the perfectly rational case (zero temperature, $\beta=0$) below, using the \gtApproach approach.

452:

453: The partition function for $\np$ agents is

454: 	\begin{equation}

455: 	\mathcal{Z}_{\np} = \int \prod_{i=1}^{\np} dq_i

456: 	  \exp\left[ -\beta\frac{b}{\np} \sum q_i q_j

457: 		-\beta\frac{\tilde{b}}{\np}\sum q_i^2 + \beta(a-c)\sum q_i \right].

458: 	\label{E:partitionCO}

459: 	\end{equation}

460:

461: The infinite-agent free energy

462: 	\begin{equation}

463: 	F(\beta,a,b,\tilde{b},c,\qmax,\qmin) =

464: 		\lim_{\np\to\infty} \frac{1}{\beta \np} \ln( \mathcal{Z}_{\np,h} ).

465: 	\label{E:FreeEnergyDef}

466: 	\end{equation}

467: is calculated in \appRef{S:appendixB} to be

468: 	\begin{equation}

469: 	\begin{aligned}

470: 	F(\beta,a,b,c,\qmin,\qmax)

471: 	=&\frac{ b(\qmax+\qmin)[ b(\qmax+\qmin)- 2(a-c) ]}{4\beta}

472: 	  -\frac{\ln(\qmax-\qmin)}{\beta}

473: 	\\

474: 	&- \frac{1}{\beta}\min_{y\in(-\infty,\infty)}

475: 		-\beta b y^2

476: 		+ \ln{\Big (} %\left(

477: 			\sinhc\left[

478: 		        \beta b(\qmax-\qmin)y +

479: 			  \beta\frac{(\qmax-\qmin)}{2} \{b(\qmax+\qmin)-(a-c)\}

480: 			\right]

481: 		{\Big )}%\right)

482: 	\end{aligned}

483: 	\label{E:freeEnergy}

484: 	\end{equation}

485: where $\sinhc(x)=\sinh(x)/x$, $\sinhc(0)=1$, and it is noted that the free energy $F$

486: is independent of $\tilde{b}$, which can be set to zero.  It is also shown in

487: \appRef{S:appendixB} that the expected value of any agent's output is

488: 	\begin{equation}

489: 	\br{q_i} = \frac{\partial}{\partial a} F

490: 	= \frac{a-c}{2b} + \frac{y_m(\beta,a,b,c,\qmin,\qmax)}{\beta Qb},

491: 	\end{equation}

492: where $|y_m|<|(\qmax+\qmin)/2 - (a-c)/(2b)|$, $y_m<0$ when $(a-c)/(2b)>(\qmax+\qmin)/2$,

493: and $y_m>0$ when $(a-c)/(2b)<(\qmax+\qmin)/2$.  Hence the Gibbs equilibrium

494: is always between the average $(\qmax+\qmin)/2$ and the Nash Equilibrium $(a-c)/(2b)$.

495: In the completely irrational limit

496: 	\begin{equation}

497: 	\lim_{\beta\to 0} \br{q_i} = \frac{\qmax+\qmin}{2},

498: 	\label{E:magIrr}

499: 	\end{equation}

500: and the output is pushed to the average as would be expected for

501: uniformly random behavior.

502: In the completely rational limit,

503: 	\begin{equation}

504: 	\lim_{\beta\to\infty} \br{q_i}

505: 	= \left\{

506: 		\begin{aligned}

507: 		&\qmin \qquad \text{if\ }&\frac{a-c}{2b}<\qmin,

508: 		\\

509: 		&\frac{a-c}{2b} &\qmin\leq \frac{a-c}{2b}\leq\qmax,

510: 		\\

511: 		&\qmax &\frac{a-c}{2b}>\qmax,

512: 		\end{aligned}

513: 	\right.

514: 	\label{E:mag}

515: 	\end{equation}

516: which is the classical Nash equilibrium for an infinite number of agents in \eqref{E:qEquil}.

517:

518: From the properties of $y_m$, we see that agents adjust output down (up) from

519: $q^*\equiv(a-c)/(2b)$ towards the average output $q_{av}\equiv(\qmax+\qmin)/2$

520: if $q^*$ itself is larger (smaller) than $q_{av}$.

521: The magnitude of the adjustment increases as the deviation from rationality

522: (i.e., temperature or $1/\beta$) increases.

523: This shows that a collusive-type behavior occurs insofar as agents produce \emph{less} than

524: the purely rational equilibrium output $q^*$ when $q^*>q_{av}$.  When $q^*<q_{av}$,

525: agents will produce \emph{more} output than $q^*$, which is anti-collusive/supercompetitive.

526:

527:

528: It is also shown in \appRef{S:appendixB} that the \emph{volatility}, which is

529: $\vty \equiv\beta^{-1} \partial^2 F/\partial h^2$, is zero at zero temperature

530: (as expected for perfect rationality) and goes to $(\qmax-\qmin)^2/12$ as temperature goes

531: to infinity (i.e., randomly acting agents have independent, uniformly distributed

532: outputs $q_i$).

533: An agent's payoff can be computed using $\br{q}$ and $\vty$.  Since

534: $\br{\Pi_k} = \lim_{\np\to\infty} q_k[ -b/n \sum_{j=1}^\np q_j + (a-c)]$ and

535: $\br{\Pi_k} = \br{\Pi_i}$, $1\leq i\leq\np$, we have

536: 	\begin{equation}

537: 	\br{\Pi_k} = \lim_{\np\to\infty}

538: 		-\frac{b}{\np}\vty  + \br{q_k}(h-b\br{q_k})

539: 	= \br{q_k}(h-b\br{q_k}).

540: 	\end{equation}

541: The maximum payoff occurs at the Nash equilibrium $\br{q_k}=q^*$.  Furthermore, from

542: the explicit formula for $\br{q_k}$ it is evident that $\br{\Pi_k}$ is an increasing

543: function of $\beta$.  Payoff increases as the deviation from rationality (i.e. temperature)

544: decreases.

545:

546: The model presented above is the standard Cournot model, which assumes

547: the inverse demand is symmetric among the agents.  In reality, this is not typically the case.

548: For example, water prices in parts of a city with higher elevation may have a higher price

549: because of additional costs of pumping the water.  Likewise, competition among agents may

550: not be perfectly uniform.  An example of this would be firms who are more competitive against

551: neighboring firms than with those far away, such as a local restaurant.  It draws customers

552: from mostly one area, and would be in greater competition with other nearby restaurants.  The

553: success or failure of distant restaurants would not have any affect.  A single member

554: of a restaurant franchise competes with local restaurants, but colludes with other more distant

555: members of the parent company.

556: In contrast to this would be a type of regional collusion, where all of the

557: firms in any given locale may work in collusion with each other in an attempt to outcompete

558: distant firms.

559:

560: To formalize these ideas, notice that an individual payoff function can be written as

561: the collusion potential minus all other agents' payoff functions:

562: 	\begin{equation}

563: 	\Pi_k = V_c - \sum_{i\not= k} \Pi_i.

564: 	\label{E:payDiff}

565: 	\end{equation}

566: This shows an agent in oligopolistic competition will offset the collusive potential by

567: subtracting the gains made by \emph{all} other agents.  A smaller degree of competition

568: would correspond to subtracting fewer of the other agents payoffs.

569:

570: \begin{example}[Local Collusion Within a Global Oligopoly]\label{Ex:lc}

571: Suppose $\np$ agents are assumed to lie at integer points on the real line.  If agents

572: work collusively in disjoint groups of three, the payoffs are of the form

573: 	\begin{equation}

574: 	\Pi^{lc}_{k\pm 1}=\Pi^{lc}_k

575: 	= \sum_{i=k,k\pm 1} \Pi_i

576:  	= \sum_{i=k,k\pm 1} \sum_{j=1}^{\np}

577: 		 \left[ -\frac{b}{\np} q_i q_j + (a-c) q_i \right]

578: 	\qquad k=3m, \text{\ for integer\ }m.

579: 	\label{E:localCollusion}

580: 	\end{equation}

581: A potential is $V_{lc}=V_o - (b/\np) \sum_{k=3m,i=k\pm 1} q_i q_k$, where $V_o$ is the

582: oligopolistic potential in \eqref{E:potentialO}.  Although the potential is not translation

583: invariant, if `block outputs' $Q_k\equiv(q_{k-1}+q_k+q_{k+1})/3$, $k=3m$, are used

584: along with periodic boundary conditions ($q_\np\equiv q_1$) then

585: a renormalized potential would be translation invariant.

586: \end{example}

587:

588: \begin{example}[Local Oligopoly Within Global Collusion] \label{Ex:lo}

589: Suppose now that agents collude with non-neighboring agents and compete oligopolistically

590: with neighboring agents within the same setting as \exRef{Ex:lc}.  Here, however, the

591: agents are competing oligopolistically with nearest neighbors, and not in disjoint groups

592: of three.  The payoffs are

593: 	\begin{equation}

594: 	\Pi^{lo}_k

595: 	= \sum_{i\not= k\pm 1} \Pi_i= V_c - \Pi_{k-1} - \Pi_{k+1},

596: 	\label{E:localCPay}

597: 	\end{equation}

598: where we define $\Pi_0\equiv 0$ and $\Pi_{\np+1}\equiv 0$ to get correct payoffs.

599: The potential is

600: 	\begin{align}

601: 	V_{lo} &= V_c + \frac{b}{n}\sum_{i=1}^{\np-1} q_i q_{i+1}\\

602: 	&=

603: 	-\frac{b}{\np} \sum q_i q_j + (a-c)\sum q_i

604: 		  + \frac{b}{\np} \sum_{i=1}^{\np-1} q_i q_{i+1}.

605: 	\label{E:localOPot}

606: 	\end{align}

607: It is shown in \appRef{S:appendixB} that the $\tilde{b}$ term in the potential $V$

608: of \eqref{E:potentialCO} has no significance in the infinite-agent limit because

609: the number of terms in the summand is of order $n$, and as such it can be set to zero.

610: This is the case for the last nearest-neighbor sum in \eqref{E:localCollusion}, and in the

611: infinite-agent limit, the free energy is the same as the collusion model.

612: To maintain local oligopolistic competition in the infinite-agent limit, the interaction

613: term $b/\np$ must be changed to a constant $\delta>0$ that does not vanish as $\np\to\infty$.

614: \end{example}

615:

616: \begin{example}[True Local Oligopoly Within Global Collusion] \label{Ex:loReal}

617: As mentioned in \exRef{Ex:lo}, an appropriate potential that maintains local competition

618: in the infinite-agent limit is

619: 	\begin{equation}

620: 	V_{lo} =

621: 	-\frac{b}{\np} \sum q_i q_j + (a-c)\sum q_i

622: 		  + \delta \sum_{i=1}^{\np-1} q_i q_{i+1},

623: 	\label{E:loReal}

624: 	\end{equation}

625: where $\delta>0$.

626: \end{example}

627:

628: The $\delta$ term in \eqref{E:loReal} only involves \emph{local interactions}, that is,

629: only terms within a fixed distance (of one) appear in the summand.  Such terms in the

630: potential are also called \emph{local interactions}.

631:

632: \begin{example}[Oligopoly With Stronger Local Competition]\label{Ex:locComp}

633: It is noted that in the infinite-agent limit, replacing $b$ with $b/2$ in \eqref{E:loReal} is

634: equivalent to changing the collusive potential component $V_c$ to the standard oligopoly

635: potential $V_o$.  We can then interpret the (infinite-agent) free energy generated by the

636: potential $V_{lo}$ to be oligopolistic competition with increased local competition.

637: \end{example}

638:

639: The potential in \eqref{E:loReal} is also generated by payoff functions

640: 	\begin{equation}

641: 	\hat{\Pi}^{lo}_k = q_k \left(

642: 	  -\frac{2b}{\np}\sum_j q_j + (a-c)

643: 		+\delta \left[ q_{k-1} +\frac{b}{\delta\np}q_k + q_{k+1} \right] ,

644: 	\right)

645: 	\label{E:loPay}

646: 	\end{equation}

647: where $q_0\equiv q_{\np+1}\equiv 0$ applies.  The dynamics generated by the

648: $\hat{\Pi}^{lo}_k$ in \eqref{E:loPay} and the $\Pi^{lo}_k$ in \eqref{E:localCPay}

649: are identical, since they generate the same potential.  Likewise, the infinite-agent

650: free energies are the same, and as argued previously, the term $bq_k/(\delta\np)$ will

651: have no effect in the infinite-agent limit.

652:

653: What is interesting about \eqref{E:loPay} is that an inverse demand function appears:

654: 	\begin{equation}

655: 	p_k(q_1,\dots,q_\np) = p(Q)

656: 				+\delta\left[ q_{k-1} +\frac{b}{\delta\np}q_k + q_{k+1} \right] ,

657: 	\label{E:locDemand}

658: 	\end{equation}

659: where $p(Q)$ is a standard linear inverse demand as in \eqref{E:linearDemand}.

660: The remaining term only involves terms local to agent $k$, and is called a

661: \emph{local (inverse) demand} function.  Also notice that payoffs cannot

662: be described by a single global demand $p(Q)$, and each agent has her own demand function

663: $p_k$.

664:

665: The reason for the consideration of the addition of local demand as with \eqref{E:loReal}

666: is that a phase transition is possible in contrast to the lack of one for the standard

667: Cournot oligopoly.  This will be shown in \secRef{S:MG} for a discrete model which is

668: a version of the Cournot model with local demand.

669:

670: Finally, we consider a situation where local demand for an agent's good is less affected

671: by more distant firms.

672:

673: \begin{example}[Power-Law Falloff of Local Demand]\label{Ex:localDemandPower}

674: Imagine agents residing at integer coordinates in a box in $d$-dimensional space; call

675: this set $Q\subset\mathbb{Z}^d$.

676: Consider the local inverse demand functions

677: 	\begin{equation}

678: 	p_k(q_1,\dots,q_\np) =

679: 	- 2 b \sum_{j\in Q:j\not=k} \frac{q_j}{|k-j|^w} + (a-c)

680: 	+ 2 \delta\sum_{i:|i-k|=1} q_i,

681: 	\label{E:powerDemand}

682: 	\end{equation}

683: where $|i-j|$ is the distance between agents $i$ and $j$, and

684: $w\geq 1$.  Here, the $\delta$-term again represents heightened local competition,

685: but the $b$-term is in a different form.  The output of agents more distant from agent $k$

686: has less of an effect on the inverse demand of agent $k$.  This can be due to a variety of

687: issues.  Local agents may have more of a local market share because more distant agents could

688: have to pay more for shipping, which can increase the price of their good in more distant

689: regions.  A variation of that is the good is primarily bought and not shipped, with customers

690: preferring to travel less distance to purchase the good.  This is in contrast to the previous

691: term $p(Q)$ in \eqref{E:linearDemand} which assumes goods are circulated uniformly in a

692: global market, with no barriers that would put more emphasis on local output.

693:

694: A potential for payoffs $\Pi_k=q_k p_k$ is

695: 	\begin{equation}

696: 	V_w =

697: 	-b \sum_{(i,j)} \frac{q_i q_j}{|i-j|^w} + (a-c)\sum q_i

698: 		  + \delta \sum_{\br{i,j}} q_i q_j,

699: 	\label{E:powerPot}

700: 	\end{equation}

701: where $(i,j)$ indicates a sum over distinct pairs of sites in $Q$ and

702: $\br{i,j}$ indicates a sum over nearest neighbors $|i-j|=1$.

703: \end{example}

704:

705:

706: It has been shown that a very interesting phase transition occurs for a discrete version

707: of \eqref{E:powerPot} above, which will be outlined in \secRef{S:MG}.

708: This phase transition is very distinct from the phase transition

709: speculated for the discrete version with the standard $p(Q)$ term.

710: We then see that an interesting feature of the Gibbs model of Cournot competition is

711: that it distinguishes between models with uniform and nonuniform distribution of

712: goods for certain ranges of parameters $\beta$, $b$, and $\delta$.

713:

714:

715: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%

716: %

717: %

718: %

719: %  SECTION: Examples

720: %

721: %

722: %

723: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%

724:

725:

726: \section{Example: Gibbsian Minority Game} \label{S:MG}

727:

728: The inductive minority game (see, for example, \cite{CZ, MaC}) is a version of the El Farol bar

729: problem \cite{A, MaCZ} that was introduced to analyze cooperative behavior in markets.  It is

730: introduced in the context of inductive reasoning and non-equilibrium statistical mechanics, but

731: we will look at it in another way since the utility functions can be derived from a potential.

732: An interesting point is that the dynamical equations in the inductive reasoning view do not

733: satisfy detailed balance and there is a dynamical phase transition separating an ergodic from a

734: non-ergodic process (c.f. the discussion in \cite{C}).  Similarly, there are Glauber dynamics

735: for the Ising model which use sequential site updating and do not in general satisfy detailed

736: balance, but nevertheless do converge to the appropriate Gibbs measure by stationarity (see

737: \cite{So} for an example).  The lack of detailed balance usually suggest a study of dynamics,

738: but the advantage of a \gtApproach approach is that the dynamics yield an explicit

739: stationary equilibrium state.

740:

741: In equilibrium analysis, many models exhibit a unique translation invariant infinite-volume

742: state (hence ergodicity) at high temperature, while at lower temperatures there are multiple

743: states and non-ergodic translation invariant limiting states exist (c.f. \cite{Si} for

744: details).  As such it seems reasonable to consider a \gtApproach model of the minority game.

745: First, for the sake of background, we will introduce the inductive reasoning form of the game

746: as presented in \cite{C}, and then contrast it with the Gibbsian version.

747:

748: The inductive game consists of $\np$ agents, labeled $i=1,\dots,\np$.  For simplicity, we can

749: picture these agents lined up at integers on a one-dimensional axis, hence the agents are

750: situated at sites $\vol\subset\mathbb{Z}$.

751: 		\footnote{

752: 		Because of the form of the potential, this model is not a $d$-dimensional

753: 		lattice model for any $d$.  We have chosen to use $d=1$ for visualization, and

754: 		later, a true $d$-dimensional term will be added to the potential.

755: 		}

756: At each discrete time step $t$, each agent $i$ must make a decision $\sigma_i\in\{-1,1\}$ (such

757: as to `buy' or `sell').  The agents decide to buy or sell based on public information $I(t)$

758: that is available to all agents at time $t$ (c.f. the discussion in \cite{C}).  Note that at a

759: given time, all agents receive the identical piece of information.  Examples of such

760: information could be the state of the market (decisions off all agents, the decisions of agents

761: on odd sites, the block/regional-averages of decisions on blocks of length $\np/100$, etc.),

762: the weather forecasts, change in government regulations, etc.

763:

764: Profit, or utility, is made at time step $t$ by agents according to the formula

765: 	\begin{equation}

766: 	u_i(t) = \frac{-b}{\np}\sigma_i(t) \sum_{i=1}^\np \sigma_j(t)

767: 	\label{E:utility}

768: 	\end{equation}

769: with $b>0$.  Because the coefficient $-b<0$, the profit $u_i(t)\geq 0$ when

770: $\sigma_i(t)=-\sgn\left[\sum_{i=1}^{\np} \sigma_j(t)\right]$.  The justification is that in a

771: market, if the majority is selling it is profitable to buy and vice-versa.  Further notice the

772: payoff function is scaled by $1/\np$.  It is mentioned in \cite{C} to exist for mathematical

773: convenience, but we will see later that it is needed for stability of the thermodynamic limit.

774: Economically it makes sense to scale this way so that the inherent inverse demand function

775: stays finite (see \secRef{S:Cournot}).

776:

777: Now the element of public information will be introduced.  It is noted in \cite{MaC} that a

778: phase transition occurs without the element of information, and all that is needed is

779: stochastic mixed strategies.  In \cite{C}, information is randomly drawn at time $t$ from a set

780: $\{I_1,\dots,I_p\}$.  For example, at time $t$ a number $\mu(t)\in\{1,\dots,p\}$ is drawn

781: randomly and the information $I_{\mu(t)}$ is given to all agents, so that $I(t)=I_{\mu(t)}$

782: with $1\leq\mu(t)\leq p$.

783:

784: The information $I(t)$ is converted into a configuration of $\np$ trading decisions

785: $\cfg(t)=(\sigma_1(t),\dots,\sigma_\np(t))$.  These are determined using `response strategies',

786: which are look-up tables or functions

787: $\mathfrak{R}_{i,\strat}=(R^1_{i,\strat},\dots,R^p_{i,\strat})\in\{-1,1\}^p$.

788: Each agent $i$ has $\strats$ decision making strategies $\mathfrak{R}_{i,\strat}$

789: with $\strat\in\{1,\dots,\strats\}$.  The strategy that each agent $i$ plays depends

790: on the public information $I(t)$.  If $I(t)=I_{\mu(t)}$, agent $i$ will choose a strategy

791: $\strat$ and then lookup the decision $R^{\mu(t)}_{i,\strat}=\pm1$ in the vector

792: $\mathfrak{R}_{i,\strat}$.  Agent $i$ will then make the decision

793: $\sigma_i(t)=R^{\mu(t)}_{i,\strat}$, which depends on her choice of strategy

794: \emph{and} public information.

795:

796: Finally, the last ingredient is for agents to evaluate strategies based on the market

797: performance of the strategies.  At time $t$, strategy $\mathfrak{R}_{i,\strat}$ is

798: profitable if and only if

799: $R^{\mu(t)}_{i,\strat}=-\sgn\left[\sum_{i=1}^{\np} \sigma_j(t)\right]$.  Each agent $i$

800: will then measure the cumulative performance of each strategy $\mathfrak{R}_{i,\strat}$,

801: $1\leq\strat\leq\strats$ by a profit-indicator $p_{i,\strat}$ which is updated

802: at each time step $t$ by

803: 	\begin{equation}

804: 	p_{i,\strat}(t+1)=p_{i,\strat}(t) -

805: 	  \frac{b}{\np} R^{\mu(t)}_{i,\strat} \sum_j \sigma_j(t).

806: 	\label{E:strategyEval}

807: 	\end{equation}

808: Note that at time $t$, all strategies are updated for each agent.  Likewise note that the above

809: is an evaluation scheme of strategies, and the strategies $R$ are coupled to the actual

810: decision $\sigma_i(t)$ (since $\sigma_i(t)$ appears in the sum).  Additionally, these

811: hypothetical decisions are being tracked in time, even though some aren't actually made.  At

812: time $t$, each agent $i$ now chooses strategy $\strat_i(t)$ which has the best

813: cumulative performance $p_{i,\strat}(t)$ up to that time.

814:

815: Given the external information ${\mu(t)}$, the dynamics of the minority game are given by

816: 	\begin{equation}

817: 	\begin{aligned}

818: 	p_{i,\strat}(t+1) &= p_{i,\strat}(t) -

819: 	  \frac{b}{\np} R^{\mu(t)}_{i,\strat} A(t),

820: 	\\

821: 	A(t) &= \sum_{j} R^{\mu(t)}_{j,\strat},

822: 	\\

823: 	\strat_i(t) &= \argmax_{ \strat \in \{1,\dots,\strats\} }

824: 					p_{i,\strat}(t).

825:  	\end{aligned}

826: 	\label{E:inductiveMG}

827: 	\end{equation}

828:

829: Some interesting observations about the above approach need be made.  Firstly, a phase

830: transition occurs in the above inductive model even though the model is a totally frustrated

831: one.  The fact that something stochastic is what introduces the phase transition makes sense

832: since this is what happens in the setting of spin glasses and replica symmetry breaking (c.f.

833: \cite{C, MaC2, BCP}).  It has also been shown in \cite{LVS, PSBS} that the inductive reasoning

834: model seems insensitive to the types of long range interactions that may occur.

835: Speculatively, it seems that the global nature of public information can outweigh the

836: interactions among agents, just as a global magnetic field can alter the

837: susceptibility/volatility in a fully frustrated model.  However, there are some fundamental

838: physical and economical differences between the long-range interactions presented in

839: \cite{LVS, PSBS}.  The economical differences were described in \secRef{S:Cournot} as local

840: versus global inverse demand and/or availability of goods.  The physical differences between

841: such interactions are significant, as will be shown below.

842:

843:

844: This motivates us to study the minority game in a different light than the inductive approach

845: above.  Now we will introduce the \gtApproach model of the minority based on statistical

846: observation of deductive reasoning and show how this model distinguishes local and global

847: demand.  The \gtApproach approach is not inductive: only the values of the agents' decisions

848: are considered.  Which inductive strategy an agent may be using is irrelevant.  In other words,

849: we do not distinguish inductive strategies $\mathfrak{R}^{i,\strat}$ that an agent may

850: be considering, but only look at her final decision $\sigma_i$ to buy or sell.

851:

852:

853: In the simplest \gtApproach model, there is no phase transition, but the infinite-volume

854: volatility (c.f. \appRef{S:appendixB} for the continuum version) does display nonrandom

855: behavior at finite

856: temperature.  There are phase transitions in the \gtApproach model if local interactions among

857: agents are added which represent greater competition among local firms.  This will be pursued

858: below.

859:

860:

861: Notice that the utility functions in \eqref{E:utility} can be deduced in the sense of \eqref{E:potentialCond} from the potential

862: 	\begin{equation}

863: 	V(\cfg) = \frac{-b}{\np} \sum_{i,j} \sigma_i \sigma_j,

864: 	\label{E:mgPotential}

865:       \end{equation}

866: where $b>0$ and there are $\np$ agents.  The agents can be pictured to lie on integer points of some lattice, but dimensionality of the lattice is irrelevant since all agents interact.  If short-range perturbations are added, the dimensionality of the lattice is then relevant.

867: In the basic model presented here, the volatility is differentiable in the global field (i.e., no phase transition; see below).  However, we will momentarily digress from the basic model in ~\eqref{E:mgPotential} and look at variations in an attempt to determine what may cause a phase transition.

868:

869:

870: Consider this new model (in \eqref{E:mgPotential} below) to lie in two dimensional space, and the addition of nearest neighbor ferromagnetic/aligning interactions.  These local aligning interactions correspond to the (local) inverse demand for an agent

871: 	\begin{equation}

872: 	p_i(\cfg) = \frac{-b}{\np} \sum_j \sigma_j + \delta \sum_{j:|j-i|=1} \sigma_j,

873: 	\label{E:invDemand}

874: 	\end{equation}

875: where $b>0$ and $\delta>0$.  When $\np$ is large, the local aligning interactions having coefficient $\delta$ correspond to the $i$th agent's inverse demand function increasing in response to local supply increase (i.e. selling, or the $\sigma_j$ changing from $+1$ to $-1$ for $|j-i|=1$).  On the surface, this seems contrary to usual supply and demand, since inverse demand should be a decreasing function of supply variables $\sigma_j$.

876: 	\footnote{One interpretation in \secRef{S:Cournot} is that such aligning terms are due to

877: 		    heightened local competition.  Aligning terms result from agents subtracting the

878: 		    utility/payoff of nearest-neighbor agents from their own utility functions.}

879: Mathematically, these interactions give more probability weight in the Gibbs measure to configurations with more alignments between local agents.

880: This could be interpreted as a reward when nearest agents to do the same thing (i.e., align: all buy or all sell).  It could be a purely psychological reward for ``following the herd'', and not part of the monetary economic demand function, since utility or payoff functions need not be based on monetary payoff alone.  It could also be a form of monetary influence where agents are willing to pay other agents to go along with their ideas.  Since an agent only has a relatively small amount of money, they would only be able to payoff several agents.  However, it is important to

881: note that the first term of \eqref{E:invDemand} may penalize the agent for alignment.  Ultimately,

882: whether or not an agent benefits from aligning depends on the phase of the system.

883:

884: A potential for the model with payoffs in \eqref{E:invDemand} is

885: 	\begin{equation}

886: 	V(\cfg) = \frac{-b}{\np} \sum_{i,j} \sigma_i \sigma_j

887: 		+ \delta \sum_{\br{i,j}} \sigma_i \sigma_j,

888: 	\label{E:mgLongShort}

889: 	\end{equation}

890: where $b>0$, $\delta>0$, $i,j\in\mathbb{Z}^2$, and the last sum is over nearest neighbors

891: $|i-j|=1$.  It has been speculated that the model in \eqref{E:mgLongShort} has a transition as

892: $T=1/\beta\to 0$ from a fully frustrated state to a ferromagnetic, or fully aligned, state

893: provided $b<4\delta$ \cite{Cannas}.  If $b>4\delta$ there would likely be no transition and

894: only full frustration as with \eqref{E:mgPotential} \cite{Cannas}.  Completely aligned states

895: should occur in some cases, and there should not always be only fully frustrated states.

896:

897:

898: In the case of overhyped introductory price offerings, essentially everyone is in a frenzy to

899: buy and we would see the configuration of $\sigma_i=-1$ for all $i$.  If news got out that a

900: company lost all of its worth, everyone would sell and we would see the configuration of

901: $\sigma_i=+1$ for all $i$. The aligning interactions are the only way to support such states

902: without the addition of a field term $(a-c)\sum\sigma_i$, as the model in \eqref{E:mgPotential}

903: has no phase transition (cf. \appRef{S:appendixB}).  The addition of such a `field' term, which

904: has demand and cost components, will support such states in the absence of a phase

905: transition.  The demand/cost component should be able to support such states, since such extreme

906: market reactions result in pricing adjustments (changes in $a$).

907: However, it was shown in \secRef{S:Cournot} that the aligning

908: terms can be interpreted as increased local competition.  Such effects can become large

909: in such a frenzied environment, and large alignments may occur no matter how small the

910: demand/cost term $a-c$ may be if the temperature is low enough (i.e., agents are not

911: deviating much from ``rational'' behavior).

912:

913:

914: The model in \eqref{E:mgLongShort} is similar to the models studied and summarized in

915: \cite{MI,CGT,MM}. The salient difference is that \cite{MI,CGT,MM} study a long-range `Coulomb'

916: ($w=1$) or `dipole' ($w=3$) on the two-dimensional square lattice

917: 	\begin{equation}

918: 	V(\cfg) =  -b \sum_{i,j} \frac{\sigma_i \sigma_j}{|i-j|^w}

919: 		+ \delta \sum_{\br{i,j}} \sigma_i \sigma_j,

920: 	\label{E:mgDipoleShort}

921: 	\end{equation}

922: where the $i,j\in\mathbb{Z}^2$ are two-component vectors with integer

923: coordinates. These  models are very rich, and interesting in an economic sense: the local

924: demand functions should depend more strongly on local agents, and not depend significantly on

925: distant agents.  This suggests that output of goods local to an agent has more of an affect

926: on that agent than goods that are distant from the agent, i.e., that

927: \emph{local inverse demand}

928: (cf. \secRef{S:Cournot}) depends more on local supply.  The model in \eqref{E:mgLongShort}

929: assumes that local inverse demand is identical for each agent, and that distant goods are equally accessible to an agent as local goods.  There is a fundamental difference between these

930: interactions.  An important difference in two dimensions between \eqref{E:mgLongShort} above

931: and models with falloff potentials where $\np$ is replaced with $|i-j|^w$, $w=1,3$, is that the

932: latter models can have a phase transition to an antiferromagnetic phase at low temperatures

933: and the former model has no such phase transition and only totally frustrated ground states.

934: Antiferromagnetic phases correspond to a checkerboard of $+1$'s and $-1$'s in the plane, which

935: is ordered behavior of agents.  The agents with potential \eqref{E:mgLongShort} have no such

936: ordered behavior: $+1$'s and $-1$'s are seen equally but in a random unordered fashion,

937: which reflects that supply is equally accessible to everyone.

938:

939:

940: There are four types of phases in the dipolar model

941: (cf. \cite{CGT} for details and references).

942: The first two occur below a critical temperature $T_c(\Delta)$ that depends on

943: $\Delta=\delta/b$.

944: When $\Delta$ is smaller than $\Delta_a\approx 0.425$, the ground state is the

945: antiferromagnetic one,

946: consisting of a checkerboard of $+1$ and $-1$ values of the $\sigma$.  When $\Delta>\Delta_a$,

947: the transition is from the antiferromagnetic state to a state consisting of

948: antialigned stripes of width one, representing large areas of buyers and large areas of

949: sellers that persist in some direction.  As $\Delta$ gets larger, the stripes grow in width.

950: For example, stripes of widths two, three, and four appear at $\Delta_k\approx 1.26,2.2,2.8$,

951: respectively.

952:

953: The other two phases occur when $T\geq T_c(\Delta)$.

954: There is a transition to a disordered \emph{tetragonal} phase above and near the

955: $T_c(\Delta)$, and this phase is not paramagnetic.

956: There are extended ferromagnetic domains characterized by

957: predominantly square corners, and a typical configuration looks like a maze of $+1$ trails

958: and $-1$ walls.  For larger temperatures, the transition is to a paramagnetic phase.

959: Similar phase behavior occurs for the $w=1$ Coloumb model in \eqref{E:mgDipoleShort} \cite{MM}.

960:

961:

962: The antiferromagnetic phase is one with buyers and sellers in a checkerboard

963: pattern.  This happens when the demand $p(Q)$ is mostly due to nearest-neighbor oligopolistic

964: competition; i.e., the terms $-b\sum_{i:|i-k|=1} q_k q_i$ dominate agent $k$'s demand function.

965: When there are stripes there are large areas persisting in some direction that are buying

966: and other such areas that are selling.  In this case, agent $k$ has localized oligopolistic

967: competition in one direction and heightened local competition in the other direction.

968: As the stripes widen, there is heightened local competition within the stripes, and a localized

969: oligopolistic competition among the groups of agents in the stripes.

970: In the tetragonal phase, there are large unordered areas of buyers and sellers that look

971: like a maze.   In the paramagnetic phase, buyers and sellers tend to appear at random, and this

972: corresponds to the standard Cournot model of nonlocalized oligopolistic competition.

973:

974: It seems intuitive that \eqref{E:mgLongShort} would not display stripes, but rather be fully

975: frustrated or completely aligned; that is, if supply is globally accessible and local prices

976: are essentially the same for all agents (such as a homogeneous market with no trade barriers

977: and uniform distribution of goods)

978: then there should not be large clumps of buyers and sellers.  On the other hand, if there are

979: no local interactions as in \eqref{E:mgPotential}, then there should only be antiferromagnetic

980: states or fully frustrated states with high probability, and no large clumps of buyers or

981: sellers.

982:

983: The numerical evidence in

984: \cite{LVS, PSBS} for the inductive minority game \eqref{E:inductiveMG} suggests it is

985: insensitive to the types of long range interactions; specifically that the types in

986: \eqref{E:mgPotential} of equal interaction strength show the same behavior as the falloff

987: types in \eqref{E:mgDipoleShort} with $\delta=0$.  This is in contrast to the deductive model

988: presented here.

989:

990: The stochastic nature of public information in the inductive model does induce a phase

991: transition.  It would be interesting to determine if the deductive model presented here with

992: a random field component, which could be represented by random marginal cost fluctuations

993: (cf. \secRef{S:Cournot}), will yield a phase

994: transition.  Such random costs could, for example, represent the fees, which vary from agent

995: to agent, charged to each agent per buy/sell.  In \cite{PGGS}, the addition of demand into a

996: model of stock trade is analyzed by introducing an effective field as mentioned above.

997:

998: Evidence is found in \cite{PSBS} that the dynamics of the inductive minority game is intrinsic

999: to resource allocation.  The view of the minority game as a potential game

1000: makes clear why this is the case, in

1001: the sense that all potential games are isomorphic to congestion problems \cite{MS}, and the

1002: potential for the minority game is a specific, discrete case of a supply-and-demand potential

1003: (c.f. \secRef{S:Cournot}).

1004:

1005: %

1006: %

1007: %

1008: %

1009: %

1010: %@@@

1011: \section{Conclusion} \label{S:Conclusion}

1012: We have seen that potential games (with nice potentials), when coupled with deviations-from-rationality,

1013: result in models for which equilibrium values of all  relevant (macroscopic) quantities

1014: such as output per agent, payoff per agent, etc., are determined by averaging over a Gibbs

1015: measure with the given potential at inverse temperature $\beta$.  The introduction of

1016: temperature in economics simply corresponds to the assumption that players deviate from

1017: the purely rational behavior of producing output in the direction of maximum increasing

1018: payoff.

1019:

1020: A very nice feature of the Gibbs approach is that it is accessible and demonstrates different

1021: patterns of behavior among agents for different types of distribution of goods.  This is

1022: desirable when modeling trade barriers, shipping costs, and real-life models such as

1023: restaurants, where demand is localized.  It can also be used in conjunction with renormalization

1024: techniques, which could model large-scale behavior of `agglomerated' agents, analogous to

1025: block spins for physical systems.

1026:

1027: It would also be of interest to pursue more depth insofar as the economic implications of

1028: different types of phases for the different models presented here.  Since the author is

1029: not an economist, such an endeavor will be left to the experts.

1030:

1031:

1032: \ack{I wish to thank Alexander Chorin for exposing me to fascinating ideas, which ultimately

1033: led to this paper.}

1034:

1035:

1036:

1037:

1038: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%

1039: %

1040: %

1041: %

1042: %  APPENDICES

1043: %

1044: %

1045: %

1046: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%

1047: \appendix

1048:

1049: \newpage

1050: %

1051: %

1052: % Fokker-Planck, Gibbs measure

1053: %

1054: %

1055: \section{Proof of Vector Fokker-Planck Equation} \label{S:appendixA}

1056: 	Here we will show the derivation of a vector Fokker-Planck equation for the joint distribution given the Langevin dynamics in \cite{AGH1}.  This proof is just a slight modification of the proof in \cite{AGH1}.  We only consider a finite number of agents $\np<\infty$.

1057:

1058: In a potential game, the It\^o equation for dynamics is $d\vx = \grad{V}dt + \sigma d\vw(t)$ as in \eqref{E:Langevin}.  For a small change in time $\Delta t$, the dynamics can be written

1059: 	\begin{equation}

1060: 	\Delta\vx(t) \equiv \vx(t+\Delta t) - \vx(t)

1061: 		= \grad{V}\Delta t + \sigma\Delta\vw(t) + \vec{o}(\Delta t),

1062: 	\label{E:langevinFinite}

1063: 	\end{equation}

1064: with $\sigma\Delta\vw(t)$ a normal random variable of mean zero and variance $\sigma^2\Delta t$, and $\vec{o}(\Delta t)$ is the usual vector `little order' notation.

1065: 	\footnote{

1066: 		    $\vec{v}(\Delta t)$ is $\vec{o}(\Delta t)$ if

1067: 		    $\lim_{\Delta t\to 0} v_i(\Delta t)/(\Delta t) = 0$ for each component

1068: 		    $v_i$ of $\vec{v}$.}

1069: The configuration of decisions $\vx(t)$ at time $t$ is hence a random variable with time-dependent joint density $f(\vx,t)$.  Let $h(\vx):\Omega\to(-\infty,\infty)$ be an arbitrary twice differentiable function such that $h$ and $\grad h$ vanish on the boundary $\partial\Omega$ of decision space $\Omega$.  Note $h$ does not explicitly depend on time.  Then the expected value of $h(\vx)$ (over phase space) at time $t+\Delta t$ is

1070: 	\begin{equation}

1071: 	E\left[ h( \vx(t+\Delta t) )\right] = \int_\Omega h(\vx)f(\vx,t+\Delta t) d\vx,

1072: 	\label{E:expectedh}

1073: 	\end{equation}

1074: where $d\vx$ is the product measure $dx_1dx_2\cdots dx_\np$ on $\Omega$.

1075: Note above that $\vx(t+\Delta t)$ on the left side represents the random variable over which the expected value $E$ is to be taken

1076: 	\footnote{

1077: 		    Rigorously, the map $\vx(t)\to \vx(t+\Delta t)$ generates a differentiable global

1078: 		    (stochastic) flow on decision space $\Omega$ for nice potentials, and the global flow has

1079: 		    a generator which will be shown below.},

1080: whereas the $\vx$'s on the right side are the points in $\Omega$.  Equation \eqref{E:langevinFinite} can be used to obtain another expression for \eqref{E:expectedh}

1081: 	\footnote{

1082: 		    For simplicity, we use the formal notation of differential  stochastic calculus.}:

1083: 	\begin{equation}

1084: 	E\left[ h( \vx(t+\Delta t) )\right] = E\left[ h( \vx(t)+\Delta x(t) ) \right]

1085: 	= E\left[ h(  \vx(t)+ \grad{V}(\vx(t))\Delta t + \sigma\Delta\vw(t) ) \right]

1086: 	+\vec{o}(\Delta t).

1087: 	\label{E:expectedh2}

1088: 	\end{equation}

1089: The expression on the right hand side of \eqref{E:expectedh} will be subtracted from the Taylor expansion of the right side of \eqref{E:expectedh2} to yield the Fokker-Planck equation.  We will proceed as follows.

1090:

1091: Let $g(\vy)$ be the (joint) density of $\sigma\Delta\vw(t)$, which is an $\np$-dimensional normal density with mean zero and variance $\sigma^2\Delta t$.  The left side of \eqref{E:expectedh2} is in terms of the random variables $\vx(t+\Delta t)$, and the right side is in terms of the variables $\vx(t)$ and $\vy$ with respective densities $f(\vx,t)$ and $g(\vy)$.  Accordingly, the right side of \eqref{E:expectedh2} can be written as an integral over these densities:

1092: 	\begin{equation}

1093: 	E\left[ h( \vx(t+\Delta t) )\right] =

1094: 	\int_{\mathbb{R}^n}\int_\Omega

1095: 	  h\left( \vx(t) + \grad{V}(\vx(t))\Delta t + \sigma\vy \right)

1096: 	f(\vx,t) g(\vy) d\vx d\vy,

1097: 	\label{E:expectedh3}

1098: 	\end{equation}

1099: where $\mathbb{R}=(-\infty,\infty)$.

1100:

1101: The Taylor expansion of the right side of \eqref{E:expectedh3} yields

1102: 	\begin{equation}

1103: 	\begin{aligned}

1104: 	\int_{\mathbb{R}^n} \int_\Omega

1105: 	&\Big\{

1106: 	 h(\vx(t)) + \grad{h}(\vx(t))\cdot[\grad{V}(\vx(t))\Delta t + \vy]

1107: 	\\

1108: 	 &\:+ \frac{1}{2}[\grad{V}(\vx(t))\Delta t + \vy]\cdot

1109: 		D^2h(\vx(t)) \cdot[\grad{V}(\vx(t))\Delta t + \vy]^T + \vec{o}(\Delta t)

1110: 	\Big\}

1111: 	f(\vx,t)g(\vw)d\vx \, d\vy,

1112: 	\label{E:TaylorE}

1113: 	\end{aligned}

1114: 	\end{equation}

1115: where $D^2h$ is the second-derivative matrix for $h$ (i.e., the $ij$ entry is $\partial^2h/\partial x_i\partial x_j$) and the transpose of a row vector $\vec{v}$ is the column vector $\vec{v}\,^T$.

1116:

1117: Integrating \eqref{E:TaylorE} over the $\vy$ eliminates terms linear in $y_i$ as well as the mixed second-order terms since the $y_i$ have mean zero and are independent.  The expected value of $w_i^2$ is $\sigma^2\Delta t$, hence \eqref{E:TaylorE} reduces to

1118: 	\begin{equation}

1119: 	\begin{aligned}

1120: 	\int_\Omega h(\vx(t)) f(\vx,t)dx

1121: 	&+ \Delta t\int_\Omega \grad{h}(\vx(t))\cdot\grad{V}(\vx(t)) f(\vx,t) d\vx

1122: 	\\

1123: 	&+ \Delta t \frac{\sigma^2}{2} \int_\Omega \nabla^2 h(\vx(t))  f(\vx,t) d\vx

1124: 	+ o(\Delta t),

1125: 	\end{aligned}

1126: 	\label{E:TaylorEInty}

1127: 	\end{equation}

1128: where $o(\Delta t)$ is the one-component version of $\vec{o}(\Delta t)$ and $\nabla^2$ is the Laplacian.  Thus the generator for the Langevin dynamics $\vx(t+t_0)=\mathfrak{L}^t \vx(t_0)$ is $\mathbb{L}=(\grad V)\cdot\grad + (\sigma^2/2) \nabla^2$, where $\mathbb{L}$ acts on twice-differentiable functions on $\Omega$ that vanish on the boundary.  It is easy to show $\mathbb{L}$ is symmetric and the domain of its adjoint is the Sobolev space $H^2(\Omega)$ for a well-behaved potential $V$.  We view its adjoint $\adj{\mathbb{L}}$ as a densely defined unbounded operator in $L^2(\Omega,d\vx)$.  The idea is apparent from the following.

1129: % see ``applied analysis'' p.267

1130:

1131:

1132: Using Green's identities, the integrals containing the terms $\grad h$ and $\nabla^2 h$ can be integrated by parts, and the boundary integrals vanish on the boundary $\partial\Omega$ (since $h$ and $\grad h$ do) leaving a new expression for \eqref{E:TaylorEInty}:

1133: 	\begin{equation}

1134: 	\begin{aligned}

1135: 	\int_\Omega h(\vx(t)) f(\vx,t)dx

1136: 	&- \Delta t\int_\Omega h(\vx(t)) \grad\cdot[\grad{V}(\vx(t)) f(\vx,t)] d\vx

1137: 	\\

1138: 	&+ \Delta t \frac{\sigma^2}{2} \int_\Omega h(\vx(t)) \nabla^2 f(\vx,t) d\vx

1139: 	+ o(\Delta t).

1140: 	\end{aligned}

1141: 	\label{E:TaylorEInty2}

1142: 	\end{equation}

1143: This shows that the adjoint of $\mathbb{L}$ operates as $\mathbb{L}^*(f)=-\grad\cdot(f \grad V) + (\sigma^2/2) \nabla^2 f$.  Equations \eqref{E:expectedh} and \eqref{E:TaylorEInty2} are identical, hence subtracting them yields

1144: 	\begin{equation}

1145: 	\begin{aligned}

1146: 	\int_\Omega h(\vx(t)) [f(\vx,t+\Delta t) - f(\vx,t)]dx =

1147: 	\Delta t\int_\Omega h(\vx(t)) [-\grad\cdot[\grad{V}(\vx(t)) f(\vx,t)]

1148: 		+ \frac{\sigma^2}{2} \nabla^2 f(\vx,t) ] d\vx

1149: 	+ o(\Delta t).

1150: 	\end{aligned}

1151: 	\label{E:FPInt}

1152: 	\end{equation}

1153: Dividing \eqref{E:FPInt} by $\Delta t$, taking the limit $\Delta t\to 0$ results in the Fokker-Planck equation \eqref{E:FokkerPlanck} since the set of all such $h$'s described are a separating set for probability densities on $\Omega$ (i.e., these $h$'s can weakly approach delta functions on $\Omega$).

1154:

1155: Let $h(\vx)$ be an observable (i.e. function) on $\Omega$ that is differentiable, vanishes on $\partial\Omega$, and doesn't depend explicitly on time.  For a stationary state $f$, the average of $h(\vx)$ is constant in time:

1156:  	\begin{equation}

1157: 	\begin{aligned}

1158: 	0 &= \frac{\partial}{\partial t} \int_\Omega h(\vx) f(\vx,t) d\vx

1159: 	\\

1160: 	  &= \int_\Omega h(\vx) \grad\cdot\left\{-[\grad{V}(\vx(t)) f(\vx,t)]

1161: 				+ \frac{\sigma^2}{2} \grad f(\vx,t)\right\}

1162: 	\\

1163: 	  &= -\int_{\Omega} \grad h(\vx) \cdot \left\{-[\grad{V}(\vx(t)) f(\vx,t)]

1164: 				+ \frac{\sigma^2}{2} \grad f(\vx,t)\right\}.

1165: 	\end{aligned}

1166: 	\label{E:FokkerPlanck2}

1167: 	\end{equation}

1168: Since $h$ was arbitrary and $\grad h$ can be made to approach delta functions, for a stationary state

1169: 	\begin{equation}

1170: 	-[\grad{V}(\vx(t)) f(\vx,t)] + \frac{\sigma^2}{2} \grad f(\vx,t) = 0.

1171: 	\label{E:statState}

1172: 	\end{equation}

1173: We then see that the Gibbs state in \eqref{E:Gibbs}

1174: is the unique solution to \eqref{E:statState}, and is the equilibrium state for the potential game.

1175: The negative of the Helmholtz free energy is in fact a Liapunov function for the dynamics and it can be used to show (see \cite{AGH1}) that the unique finite-agent solution \eqRef{E:Gibbs} of \eqRef{E:statState} is attained in the long run.

1176:

1177:

1178:

1179:

1180:

1181:

1182:

1183:

1184:

1185:

1186:

1187:

1188: \newpage

1189:

1190: %

1191: %

1192: %  APPENDIX

1193: %

1194: %

1195: % Partition function solution and magnetization

1196: %

1197: %

1198: \section{Solution of the Cournot Free Energy} \label{S:appendixB}

1199: The free energy for the Cournot oligopoly and collusion is obtained from the partition function in \eqRef{E:partitionCO}

1200: 	\begin{equation}

1201: 	\mathcal{Z}_{\np}\!\!\left(\tilde{b}\right) = \int \e^{\beta V}

1202: 	= \int_{\qmin}^{\qmax}

1203: 	  \exp\left[ -\beta\frac{b}{\np} \sum_{j,k=1}^{\np} \tilde{q}_j \tilde{q}_k

1204: 		-\beta\frac{\tilde{b}}{\np}\sum_{j=1}^{\np} \tilde{q}_j^2

1205: 		+ \beta(a-c)\sum_{j=1}^{\np} \tilde{q}_j \right]

1206: 	  \prod_{j=1}^{\np} d\tilde{q}_j.

1207: 	\label{E:partitionCO2}

1208: 	\end{equation}

1209: The limit of the free energy

1210: 	\begin{equation}

1211:       F_{\np} \left(\tilde{b}\right)=\frac{1}{\beta\np}

1212: 		\ln\!\left(\mathcal{Z}_{\np}\!\!\left(\tilde{b}\right)\right)

1213: 	\label{E:freeEnergy}

1214: 	\end{equation}

1215: will be shown below.  First note that the limit does not depend on the value of $\tilde{b}$, since

1216: 	\begin{equation}

1217: 	-\frac{\tilde{b}\qmax^2}{n} + F_{\np}

1218: 	\leq

1219: 	F_{\np} \left(\tilde{b}\right)

1220: 	\leq

1221: 	-\frac{\tilde{b}\qmin^2}{n} + F_{\np},

1222: 	\end{equation}

1223: where $F_{\np}\equiv F_{\np}(0)$.  It then suffices to set $\tilde{b}$ to zero in \eqRef{E:partitionCO2}, which will be done below to simplify a limit.

1224: For further manipulation, changing variables to $q=\tilde{q}-\gamma$, $\gamma\equiv(\qmax+\qmin)/2$, $h\equiv a-c$, results in

1225: 	\begin{equation}

1226: 	\begin{aligned}

1227: 	\mathcal{Z}_{\np}

1228: 	= &\exp\left[ -\beta b\gamma^2 n - \beta\tilde{b}\gamma^2+ \beta h\gamma n

1229: 			  + \beta \frac{n}{b} \left(

1230: 			       \gamma b + \gamma \frac{\tilde{b}}{\np} - \frac{h}{2}

1231: 			    \right)^2

1232: 	     \right]

1233: 	\\

1234: 	&\;\times

1235: 	\int_{[-Q/2,Q/2]^\np} \exp\left[

1236: 	  -\beta \left( \sqrt{\frac{b}{\np}} \sum q_j

1237: 	  + \sqrt{\frac{\np}{b}} \left( b\gamma + \frac{\tilde{b}}{\np}\gamma - \frac{h}{2} \right)

1238: 		\right)^2

1239: 	  -\beta\frac{\tilde{b}}{\np}\sum q_j^2 \right]

1240: 	  \prod_{j=1}^{\np} dq_j,

1241: 	\end{aligned}

1242: 	\label{E:partitionCO3}

1243: 	\end{equation}

1244: where $Q\equiv\qmax-\qmin$.

1245:

1246: The solution can be found using a saddle point method, and the key is to use the identity

1247: 	\begin{equation}

1248: 	\e^{-\beta p^2} = \frac{1}{ \sqrt{\pi}\sqrt{4\beta} }

1249: 	\int_{-\infty}^{\infty}

1250: 	   \exp\left( -\frac{1}{4\beta} t^2 + ipt \right) dt

1251: 	\label{E:approx}

1252: 	\end{equation}

1253: as with the simplest case in \cite{K}.

1254: Using the above in \eqref{E:partitionCO3} and setting $\eta\equiv b\gamma + \tilde{b}\gamma/n - h/2$ results in

1255: 	\begin{equation}

1256: 	\begin{aligned}

1257: 	\mathcal{Z}_\np

1258: 	= \frac{1}{ \sqrt{\pi} \sqrt{4\beta} }

1259: 		&\exp\left[ \frac{\beta h^2 \np}{4b} + \beta \tilde{b} \gamma \left(

1260: 			\frac{ \tilde{b}\gamma }{ b \np} + \gamma  - \frac{h}{b} \right)

1261: 	     \right]

1262: 	\int_{-\infty}^{\infty}

1263: 		\exp\left[\frac{-1}{4\beta}t^2

1264: 			+ i\sqrt{\frac{\np}{b}} \eta t

1265: 		\right]

1266: 	\\

1267: 	&\;\times

1268: 		\left\{ \int_{-Q/2}^{Q/2}

1269: 	  \exp\left[-\beta \frac{\tilde{b}}{\np} q^2 \right]

1270: 	  \exp\left[ i \sqrt{\frac{b}{\np}}q t \right]

1271: 	  dq

1272: 	\right\}^\np dt,

1273: 	\end{aligned}

1274: 	\label{E:par2}

1275: 	\end{equation}

1276: where the order of integration was changed via Fubini's theorem.  Consider part of the integrand above

1277: 	\begin{equation}

1278: 	g_{\np}\!\left( \sqrt{\frac{b}{\np}}\,t \right)

1279: 	=

1280: 	\frac{1}{Q} \int_{-Q/2}^{Q/2}

1281: 		\exp\left[-\beta \frac{\tilde{b}}{\np} q^2 \right]

1282: 	  	\exp\left[ i q \sqrt{\frac{b}{\np}}t \right]

1283: 	dq

1284: 	= \sinc\!\!\left( \frac{Q}{2}\sqrt{\frac{b}{\np}}\,t \right),

1285: 	\label{E:g}

1286: 	\end{equation}

1287: where $\sinc(x)\equiv\sin(x)/x$ ($\sinc(0)\equiv 1$), $\tilde{b}$ was explicitly set to zero, and a normalizing factor was added so that $g_t(0)=1$.

1288: Changing variables to $\hat{t}=t/\sqrt{n}$, the partition function can then be written

1289: 	\begin{equation}

1290: 	\begin{aligned}

1291: 	\mathcal{Z}_{\np}

1292: 	=

1293: 	&\exp\left[-\beta\gamma(b\gamma - h)\np\right]

1294: 	\frac{ Q^{\np} \sqrt{\np} }{ \sqrt{\pi}\sqrt{4\beta} }

1295: 	\\

1296: 	\times

1297: 	  &\;\int_{-\infty}^{\infty}

1298: 		\left\{

1299: 		\exp \left[ -\frac{1}{4\beta}

1300: 		 \left( \hat{t} - i\frac{2\beta}{\sqrt{b}} \eta \right)^2

1301: 		\right]

1302: 		\sinc\!\!\left( \frac{Q}{2}\sqrt{b}\,\hat{t} \right)

1303: 		\right\}^{\np}

1304: 		\,d\hat{t}.

1305: 	\end{aligned}

1306: 	\label{E:par3}

1307: 	\end{equation}

1308: Now a basic fact is needed to determine the limit of the free energy.

1309:

1310:

1311: %

1312: %

1313: % saddle point integral Lemma

1314: %

1315: %

1316: \begin{lemma}\label{L:converge}

1317: Let $f(x)$ be a bounded, continuous, real-valued, integrable function on the real line

1318: $\mathbb{R}$ and let $G$ be an open set.  Suppose there is a point $x_0\in G$ for which

1319: $|f(x)|<f(x_0)$ for all $x\not= x_0$ and $\limsup|f(x)|<f(x_0)$ for all $x$ outside of

1320: some neighborhood of $x_0$ contained in $G$.  Then

1321: 	\begin{equation}

1322: 	\lim_{m\to\infty}

1323: 	  \left\{  \int_G \left[ f(x) \right]^{m+k} \,dx

1324: 	  \right\}^{1/m}

1325: 	= f(x_0)

1326: 	\label{E:fnLim}

1327: 	\end{equation}

1328: for any fixed integer $k$.

1329: Furthermore, for a function $g(x)$ that is continuous at $x_0$ with $gf^k\in L^\infty$,

1330: 	\begin{equation}

1331: 	\lim_{m\to\infty}

1332: 	\frac{\int_{-\infty}^{\infty} g(x) \left[ f(x) \right]^m \,dx

1333: 	    }{\int_{-\infty}^{\infty} \left[ f(x) \right]^m \,dx}

1334: 	= g(x_0).

1335: 	\label{E:measureLim}

1336: 	\end{equation}

1337: \end{lemma}

1338: \begin{proof}

1339: First, by factoring $\|f\|_{\infty}$ out of the integral in \eqref{E:fnLim}, we may

1340: assume $f(x_0)=1$

1341: by considering the function $f/\|f\|_{\infty}$.  Choose a number $\epsilon_0>0$ so that

1342: $\limsup|f(x)|<1-2\epsilon_0$ for all $x$ outside of a neighborhood

1343: $\mathcal{N}\subset G$ of $x_0$ and $f>0$ on $\mathcal{N}$.

1344: Let $S_{1-\epsilon_0}=\left\{ x\in G | f(x)\geq 1-\epsilon_0 \right\}$, and

1345: $I_m=( \int_G [ f(x) ]^{m+k} \,dx )^{1/m}$.

1346: Since $I_m \leq f(x_0)^{(m+k-2)/m} ( \int_G f^2(x)\,dx )^{1/m}$ for large $m$,

1347: $\limsup_{m\to\infty} I_m \leq f(x_0)$.  The inequality

1348: 	\begin{equation}

1349: 	(I_m)^m \geq (1-\epsilon)^{m+k-2} \int_{S_{1-\epsilon}} f^2(x)\,dx

1350: 		- (1-2\epsilon_0)^{m+k-2} \int_{G-\mathcal{N}} f^2(x)\, dx

1351: 	\end{equation}

1352: holds for all $\epsilon\leq\epsilon_0$.  Upon factoring out $(1-\epsilon)^{m+k-2}$,

1353: it is seen that $\liminf_{m\to\infty} I_m\geq f(x_0)$.

1354: To prove \eqref{E:measureLim}, the following is needed:

1355: 	\begin{equation}

1356: 	\lim_{m\to\infty}

1357: 	\left|\frac{\int_{\mathbb{R}-S_{1-\epsilon}} g(x) \left[ f(x) \right]^m \,dx

1358: 	          }{\int_G \left[ f(x) \right]^m \,dx}

1359: 	\right|

1360: 	\leq

1361: 	\|gf^k\|_\infty  \lim_{m\to\infty}

1362: 	\frac{\int_{\mathbb{R}-S_{1-\epsilon}} |f(x)|^{m-k} \,dx

1363: 	    }{\left|\int_G \left[ f(x) \right]^m \,dx\right|}

1364: 	=0,

1365: 	\label{E:meas1}

1366: 	\end{equation}

1367: where the last equality follows from the root test for convergence since the

1368: hypothesis implies the limit of the ratio on the right side of \eqref{E:meas1}

1369: to the power $1/m$ is less than one.  With this, \eqref{E:measureLim} reduces to

1370: 	\begin{equation}

1371: 	\frac{\int_{S_{1-\epsilon}} g(x)[f(x)]^m\,dx

1372: 	    }{\int_{\mathbb{R}} [f(x)]^m\,dx}.

1373: 	\end{equation}

1374: By the continuity of $g$ and \eqref{E:meas1} again with $G=S_{1-\epsilon}$, it is seen

1375: that \eqref{E:measureLim} holds.

1376: \end{proof}

1377:

1378: To apply \lemRef{L:converge} to the partition function in \eqref{E:par3}, the integration

1379: path along the real line must be deformed in the complex plane to a path on which the

1380: integrand is real-valued.  As such, consider the function

1381: $f(z)=\exp\left[-k(z +i \eta_0)^2\right]\sinc(z)$, where $\eta_0= -\beta Q\eta$ and

1382: $1/k=\beta bQ^2$ (changing variables $t=Q\sqrt{b}\,\hat{t}/2$ in \eqref{E:par3}; note

1383: $\eta_0$ is increasing in $h$).

1384:

1385:

1386:

1387:

1388: Let $z=x+iy$, and $f(z)=\mathcal{R}(x,y)+i\mathcal{I}(x,y)$, where

1389: $x,y,\mathcal{R},\mathcal{I}$ are real-valued.  Then we are interested in the paths

1390: $(x(s),y(s))$ in the complex plane $\mathbb{C}$ on which $\mathcal{I}=0$.  Since $f$ is

1391: holomorphic and not identically zero, the gradient of $\mathcal{I}$, $\grad\mathcal{I}$,

1392: can only vanish on a closed set of isolated points in $\mathbb{C}$.  Thus $\mathcal{I}^{-1}(0)$

1393: consists of various differentiable paths in $\mathbb{C}$.  The imaginary part can be

1394: expanded

1395: 	\begin{equation}

1396: 	\begin{aligned}

1397: 	\mathcal{I}(x,y) =

1398: 		\frac{ \e^{-kx^2 + k(y+\eta_0)^2} }{ x^2+y^2 }

1399: 		&\big\{ \sin(-2kx(y+\eta_0))[x\sin x \cosh y + y\cos x \sinh y]

1400: 		\\

1401: 			&+ \cos(-2kx(y+\eta_0))[-y\sin x \cosh y + x\cos x \sinh y]

1402: 		\big\}.

1403: 	\end{aligned}

1404: 	\label{E:I}

1405: 	\end{equation}

1406:

1407: %

1408: %

1409: % deform to real path Lemma

1410: %

1411: %

1412: \begin{lemma}\label{L:deform}

1413:   There is a piecewise differentiable path $z(s)$ such that $\mathcal{I}(z(s))=0$.

1414: In addition, $z(s)$ can be written $(x,y_z(x))$ for small and large $|x|$, with

1415: $\lim_{x\to\pm\infty}y_z(x)=C_1$ and $\lim_{x\to 0}y_z(x)=C_2$ for constants

1416: $C_1$ and $C_2$.

1417: \end{lemma}

1418: \begin{proof}

1419: Consider fundamental domains in the $z$-plane of $w=\sinc(z)$.

1420: The fundamental domains $D_j$, $D'_j$ shown in figure \ref{F:lnsinc} are then

1421: separated by curves satisfying $y=0$ and $\tan x/x=\tanh y/y$, with $x\not=0$.

1422: If $x>0$, the implicit function theorem implies the latter

1423: curves can be written $x=c_j(y)$ with $c_j$

1424: differentiable and $j\pi<c_j(y)<(2j+1)\pi/2$, $j=1,2,\dots$.

1425: Let $c_0(y)=0$ and for $x<0$, the curves are reflected: $c_{-j}=-c_j$.

1426: The branch cuts in the $w$-planes are then the images of $c_{2j+1}$.

1427: The logarithm $\ln(\sinc z)$ will now map two interior points in the $z$ plane

1428: to a complex number since $\sinc(-z)=\sinc(z)$.

1429: Branch cuts at $|x|>0$ are added in the $z$-plane on the $x$-axis to

1430: avoid discontinuities in $\alpha$.

1431: The fundamental domains $D_j$ lie below the $x$-axis between the curves $c_{2j-1}$

1432: and $c_{2j+1}$, whereas the $D'_j$ lie above the $x$-axis between $c_{-2j-1}$

1433: and $c_{-2j+1}$.

1434: The shaded regions of the $D_j$ and $D'_j$ in the figure below

1435: correspond to the lower half of the $w$-planes.

1436:

1437: Let $\alpha=\alpha(x,y)$ be the angle of $\sinc(z)$ (i.e., the imaginary

1438: part of $\ln(\sinc z)$).  Figure \ref{F:lnsinc} shows that in the lower half

1439: of the $z$-plane, $\alpha$ increases in $x$ and is of order

1440: $x$.

1441:

1442:

1443: % Syntax:  \centerwmf{<width>}{<height>}{<path+filename>}

1444: %     Requires "\input setwmf" at the beginning of your file.

1445: % Optional:  <path> (use / instead of \), specifies path of TeX file if not supplied.

1446: % Example:  \centerwmf{3in}{2in}{c:/mysubdir/mypic.emf}

1447: %\centerwmf{7in}{5in}{fundDomain.wmf}

1448: %\begin{figure}

1449: %\centerwmf{7in}{5in}{fundDomain.wmf}

1450: %\caption{$\ln(\sinc z)$ values}

1451: %\label{F:lnsinc}

1452: %\end{figure}

1453:

1454: \begin{figure}[h]

1455: \begin{center}

1456: \includegraphics[width=7in,height=5in]{FundDomain.ps}

1457: \caption{$\ln(\sinc z)$ values}

1458: \label{F:lnsinc}

1459: \end{center}

1460: \end{figure}

1461:

1462:

1463:

1464:

1465: Differentiating the identity for $\tan\alpha$,

1466: 	\begin{align}

1467: 	\frac{\partial\alpha}{\partial x} &=

1468: 	\frac{2y\left( \sin^2x+ \sinh^2y \right) -\left(x^2+y^2\right)\sinh2y}

1469: 		{2\left(x^2+y^2\right)\left(\sin^2x+\sinh^2y\right)}

1470: 	\label{E:alphax}

1471: 	\\

1472: 	\frac{\partial\alpha}{\partial y} &=

1473: 	\frac{-2x\left( \sin^2x+ \sinh^2y \right) +\left(x^2+y^2\right)\sin2x}

1474: 		{2\left(x^2+y^2\right)\left(\sin^2x+\sinh^2y\right)}

1475: 	\label{E:alphay}

1476: 	\end{align}

1477: and it is seen that $\frac{\partial\alpha}{\partial x}>0$ ($<0$) when $y<0$ ($>0$).

1478:

1479: The part in braces in \eqref{E:I} can be written

1480: 	\begin{equation}

1481: 	\sqrt{x^2+y^2}\sqrt{\sin^2 x + \sinh^2 y}

1482: 	\,\sin(-2kx(y+\eta_0)+\alpha),

1483: 	\label{E:Ireduced}

1484: 	\end{equation}

1485: and this is zero if and only if $\mathcal{I}$ is zero.  For the case $\eta_0=0$, the integral

1486: in \eqref{E:par3} can be evaluated using \lemRef{L:converge}, with the maximum of the integrand

1487: occuring at zero. In the case $\eta_0\not= 0$,  the paths in $\mathbb{C}$ on which $f(z)$ is

1488: real-valued is precisely where

1489: 	\begin{equation}

1490: 	-2kx(y+\eta_0)+\alpha = m\pi,

1491: 	\label{E:realF}

1492: 	\end{equation}

1493: where $m$ is an arbitrary integer.

1494:

1495: Since $y+\eta_0 = \alpha/(2kx) - m\pi/(2kx)$ and $\alpha$ is of order $x$, it is evident

1496: that $|y|\to\infty$ as $x\to 0$ on such paths unless $m=0$.  This is not surprising, since

1497: large $|z|$ behavior of $f$ is dominated by the $z^2$ term in the exponential (thus giving

1498: hyperbolae-type curves).  We will single out the $m=0$ curve $z(s)=(x(s),y(s))$ satisfying

1499: 	\begin{equation}

1500: 	y +\eta_0=\alpha(x,y)/(2kx)

1501: 	\label{E:newCurve}

1502: 	\end{equation}

1503: for reevaluating the integral in \eqref{E:par3}.

1504: Since $\alpha$ is of order $x$, the $y$-component of $z(s)$, $y(s)$, is bounded and has

1505: finite limits as $x\to\pm\infty$.  The location of the curve depends on $\eta_0$.

1506: To see this, note that $\alpha(x,y)<Dx$ for a constant $D$ with

1507: $|D|\leq 1$, and the sign of $D$ is negative in the upper-half and positive in the

1508: lower-half of the $z$-plane.  If $\eta_0>0$ and the curve $z(s)$ were in the upper-half

1509: of the $z$-plane, then $y=\alpha/(2kx)-\eta_0\leq D/(2k) - \eta_0 <0$, which is

1510: a contradiction.  Hence if $\eta_0>0$ ($<0$), then the curve $z(s)$ is in the lower (upper)

1511: half of the $z$-plane.

1512:

1513: The $y$-intercept of $z(s)$ can be found using $\lim_{x\to 0^+}\alpha=0$

1514: and $\lim_{x\to 0} \sin(\alpha)/\alpha =1$.  Near $x=0$, $z(s)$ can be written

1515: $z=(x,y_z(x))$, from which

1516: 	\begin{equation}

1517: 	\begin{aligned}

1518: 	y_z\!(0)+\eta_0 = \lim_{x\to 0} \frac{\alpha}{2kx}

1519: 	&=

1520: 	\lim_{x\to 0} \frac{-y_z \sin x \cosh y_z + x\cos x \sinh y_z}

1521: 		{ 2kx\sqrt{x^2+y_z^2}\sqrt{\sin^2 x \cosh^2 y_z + \cos^2 x \sinh^2 y_z} }

1522: 	\\

1523: 	&= \frac{-y_z\!(0)\cosh y_z\!(0) + \sinh y_z\!(0)}{2ky_z\!(0)\sinh y_z\!(0)}.

1524: 	\end{aligned}

1525: 	\label{E:curvey}

1526: 	\end{equation}

1527: The solutions of the above equation in $y_z\!(0)$ are the same as the solutions in $y$ to the equation $y/[1-2k(y+\eta_0)y]=\tanh(y)$, except for an extraneous root of $y=0$ in the latter (since $\eta_0\not= 0$).  The solution for the $y$-intercept is unique, and this will be shown

1528: later.

1529: \end{proof}

1530:

1531:

1532:

1533: With the above, the integral in \eqref{E:par3} can be deformed onto the curve $z(s)$.

1534: To apply \lemRef{L:converge},

1535: it is necessary to find the maximum of $|f|$ on the full curve where $\mathcal{I}=0$, and to

1536: also show it occurs at a single point $z_m$ such that $f(z_m)>0$.

1537:

1538: Equivalently, it will be easier to find the maximum of

1539: 	\begin{equation}

1540: 	|f|^2=\exp\left[-2kx^2+2k(y+\eta_0)^2\right]

1541: 		\left(\sin^2x\cosh^2y+\cos^2x\sinh^2y\right)/(x^2+y^2)

1542: 	\end{equation}

1543: on the full $\mathcal{I}=0$ curve $z(s)=(x(s),y(s))$.

1544: Since $\lim_{s\to\pm\infty}f(z(s))=\lim_{x\to\pm\infty}f(x,y_z(x))=0$, a maximum occurs and is greater than zero.  On the curve near this maximum, say at $s_m$, the argument of $f$ is constant (either $\Arg(f)=0$ or $\Arg(f)=\pi$) hence $\tan(\Arg(f))=\mathcal{I}/\mathcal{R}$ is constant.  Since $\tan(\Arg[f(x(s),y(s))])'=0$, the tangent vector $\vec{v}(s)$ to the curve $(x(s),y(s))$ is orthogonal to $\grad(\mathcal{I}/\mathcal{R})$ near $s_m$.

1545:

1546: Any maximum point of $|f|^2$ on the curve $z(s)$ must satisfy

1547: 	\begin{equation}

1548: 	\begin{aligned}

1549: 	0=\frac{\partial}{\partial x}|f|^2 =

1550: 			&\frac{\exp[-2x^2+2(y+\eta_0)^2]}{(x^2+y^2)^2}

1551: 		\\

1552: 		  &\times

1553: 		   \left\{ -2x(2kx^2+2ky^2+1)\left(\sin^2x\cosh^2y+\cos^2x\sinh^2y\right)

1554: 				+ (x^2+y^2)\sin2x

1555: 		   \right\}

1556: 	\end{aligned}

1557: 	\label{E:x}

1558: 	\end{equation}

1559: and

1560: 	\begin{equation}

1561: 	\begin{aligned}

1562: 		0=\frac{\partial}{\partial y}|f|^2 =

1563: 			&\frac{\exp[-2x^2+2(y+\eta_0)^2]}{(x^2+y^2)^2}

1564: 		\\

1565: 		  &\times

1566: 		   \left\{(4k(y+\eta_0)(x^2+y^2)-2y)

1567: 			\left(\sin^2x\cosh^2y+\cos^2x\sinh^2y\right)

1568: 			+ (x^2+y^2)\sinh2y

1569: 		   \right\}.

1570: 	\end{aligned}

1571: 	\label{E:y}

1572: 	\end{equation}

1573:

1574:

1575: %%%%%%%%%%%%%%%%%

1576: %

1577: % maximum point lemma

1578: %

1579: %%%%%%%%%%%%%%%%%

1580: \begin{lemma}\label{L:saddleRegion}

1581: The conditions in \eqref{E:x} and \eqref{E:y} imply that the maximum of $|f|^2$

1582: on the curve $z(s)$ occurs at $(0,y_z(0))$, the $y$-intercept of the curve $z(s)$.

1583: \end{lemma}

1584: \begin{proof}

1585: Without loss of generality it can be assumed that $\eta_0\geq0$, since the integral in \eqref{E:par3} is an even function of $\eta_0$.

1586:

1587: For the case $\eta_0=0$, the curve $z(s)$ is the $x$-axis and

1588: $|f|^2(z(s))=\e^{-2kx^2}\sinc^2x$.  Clearly $(0,0)$ is a global maximum of $|f|^2$

1589: on $z(s)$.  Hereon, the case $\eta_0>0$ is considered.

1590:

1591: We will consider the case $x\not= 0$, and $x=0$ will be dealt with subsequently.  Then \eqref{E:x} is equivalent to

1592: 	\begin{equation}

1593: 	1 > \sinc2x

1594: 	  = \frac{2kx^2 + 2ky^2 + 1}{x^2+y^2}

1595: 		(\sin^2x + \sinh^2y)

1596: 	  \geq 0.

1597: 	\label{E:xCond1}

1598: 	\end{equation}

1599:

1600: The inequality $\sinc2x\geq 0$

1601: implies $x\in[-\pi/2,\pi/2]$ or $x\in\pm[m\pi,(m+1)\pi/2]$ for positive integers $m$.

1602: In order for \eqref{E:xCond1} to hold, $\sin^2x+\sinh^2y \leq \sinc(2x)(x^2+y^2)$, which

1603: is impossible for $|x|\leq\pi/2$.  Any extreme points of $|f|^2$ for which $x\not= 0$

1604: must therefore satisfy $|x|\geq\pi$.

1605:

1606: From the above argument, maximum points of $|f|^2$ in the set

1607: $S_0=\{(x,y):|x|\leq\pi/2\}$ must satisfy $x=0$.

1608: Any maximum point $(x,y)\notin S_0$ will then satisfy $|x|\geq\pi$.

1609: It is seen from \eqref{E:newCurve} that $y+\eta_0<0$ when $x>0$ and $y<0$, which

1610: along with \eqref{E:y}, implies that $|y|>(\pi^2+y^2)\tanh|y|$ at an extreme point.

1611: That this is impossible shows the largest value of $|f|^2$ on the curve $z(s)$

1612: will occur at $x=0$.

1613:

1614: It only remains to show the intersection of $z(s)$ with the $y$-axis is unique and

1615: is also a maximum point of $|f|^2$.  Note that \eqref{E:x} is satisfied when $x=0$.

1616: Also note that \eqref{E:y} reduces to the condition

1617: $[ 4k(y+\eta_0)y^2 - 2y ]\sinh^2y + y^2\sinh2y = 0$

1618: As described by \eqref{E:curvey}, $y=0$ is an extraneous root since $\eta_0>0$, and

1619: \eqref{E:y} is satisfied if and only if

1620: 	\begin{equation}

1621: 	\frac{y}{\beta}=\frac{bQ^2}{2}\left(-\coth y +\frac{1}{y}\right)

1622: 				+Q \left( b\gamma - \frac{h}{2} \right) \equiv g(y).

1623: 	\label{E:yNew}

1624: 	\end{equation}

1625: As was mentioned in \lemRef{L:converge}, any solution $y_0$ must be negative since

1626: $\eta_0>0$.  The function $g(y)$ on the right side of \eqref{E:yNew}

1627: decreases from $bQ^2/2 + Q(b\gamma - h/2)$ to $Q(b\gamma -h/2)<0$  as $y$

1628: increases from $-\infty$ to zero.  Therefore $g(y)$ intersects $h(y)=y/\beta$ at

1629: exactly one point $y_0<0$.

1630: This shows that there is a unique solution $y_0<0$ to

1631: \eqref{E:yNew}.  Consequently, the global maximum of $|f|^2$ on $z(s)$

1632: occurs at $(0,y_z(0))$, the $y$-intercept of the curve $z(s)$.

1633: \end{proof}

1634:

1635: Combining the result of \lemRef{L:saddleRegion} with \lemRef{L:converge}, it is seen

1636: that

1637: 	\begin{equation}

1638: 	F(\beta,a,b,c,\qmin,\qmax) =

1639: 		\lim_{\np\to\infty} \frac{1}{\beta \np}

1640: 			\ln( \mathcal{Z}_{\np,\beta,a,b,c,\qmin,\qmax} )

1641: 		= - \gamma(b\gamma-h)

1642: 		 +\frac{\ln Q}{\beta}

1643: 		 +\frac{1}{\beta}\ln\left[ f(0,y_z(0)) \right],

1644: 	\label{E:cournotPartCalc0}

1645: 	\end{equation}

1646: and the unique global maximum of $|f|$ at $(0,y_z(0))$ on the curve $z(s)$ excludes

1647: any phase transition for this antiferromagnetic model.

1648:

1649: Finally, notice that \lemRef{L:saddleRegion} shows that the only extreme point

1650: of $|f|^2$ on the $y$-axis is at the $y$-intercept of the curve $z(s)$.  This

1651: extreme point must be a saddle point of the holomorphic function $f(z)$, hence

1652: $(0,y_z(0))$ is a global minimum of $v(y)=f(0,y)$.  An explicit formula for the

1653: Cournot free energy can then be written

1654: 	\begin{equation}

1655: 	\begin{aligned}

1656: 	F(\beta,a,b,c,\qmin,\qmax)

1657: 	=&-\frac{1}{4} (\qmax+\qmin)[ b(\qmax+\qmin)- 2(a-c) ]

1658: 	  +\frac{\ln(\qmax-\qmin)}{\beta}

1659: 	\\

1660: 	&+ \frac{1}{\beta}\min_{y\in(-\infty,\infty)}

1661: 		-\beta b y^2

1662: 		+ \ln\left(\sinhc\left[

1663: 				\beta b(\qmax-\qmin)y +

1664: 				\beta\frac{(\qmax-\qmin)}{2}\{b(\qmax+\qmin)-(a-c)\}

1665: 				\right]

1666: 			\right)

1667: 	\end{aligned}

1668: 	\label{E:cournotPartCalc}

1669: 	\end{equation}

1670: where $y$ was translated and rescaled (this does not alter the result since $y$ ranges over all real numbers) to get rid of square roots, and $\sinhc(u)=\sinh(u)/u$ with

1671: $\sinhc(0)=1$.

1672:

1673: Now for a final note on the expected value of the variable $q_i$.  The minimum value in

1674: \eqref{E:cournotPartCalc} occurs when the derivative of the smooth function within the

1675: minimum is zero, which is at $y_m=y_z(0)$.  Since the minimum is a global minimum,

1676: the second derivative with respect to $y$ is nonzero.  By the implicit function

1677: theorem, the minimum point $y_m$ is locally a smooth function of $h=a-c$.

1678: As a result, the partition function in the form \eqref{E:cournotPartCalc0} is seen

1679: to be a smooth function of $h$.

1680:

1681:

1682: The function $F_\np(h)$ in \eqref{E:freeEnergy} is convex in $h$ via Holder's

1683: inequality, and as a result

1684: 	\begin{equation}

1685: 	\frac{F_\np(h)-F_\np(h-\delta)}{\delta}

1686: 	\leq F'_\np(h)

1687: 	\leq \frac{F_\np(h+\delta)-F_\np(h)}{\delta}

1688: 	\label{E:mag}

1689: 	\end{equation}

1690: where $\delta>0$.

1691: Since $F(h)$ is a smooth function of $h$, taking the limit of \eqref{E:mag} as $\np\to\infty$

1692: results in $F'(h)=\lim_{\np\to\infty}F'_\np(h)$.

1693: Using \eqref{E:par3}

1694: 	\begin{equation}

1695: 	F'_{\np}(h) = \gamma - \eta/b - i\brfn{\hat{t}}/(2\beta\sqrt{b}),

1696: 	\label{E:mag2}

1697: 	\end{equation}

1698: where $\brfn{\cdot}$ is a (one-dimensional) Fourier transform of the Gibbs measure for

1699: $\np$ agents,

1700: $\brn{f}=(\int f\,\e^{\beta V})/\mathcal{Z}_\np$, which is generated by \eqref{E:partitionCO2}.

1701: The measure $\brfn{\cdot}$ is generated by the form of the partition function in

1702: \eqref{E:par3}.

1703: Taking the limit of \eqref{E:mag2} as $\np\to\infty$,

1704: 	\begin{equation}

1705: 	F'(h)= h/(2b) + \lim_{\np\to\infty}\brfn{2t/(Q\sqrt{b})}/(2\beta\sqrt{b}).

1706: 	\end{equation}

1707: The latter limit can be evaluated by deforming the integral in the complex plane as before

1708: (change $t$ to $z$), and using \lemRef{L:converge}.

1709: The sequence $\brfn{\cdot}$ of measures converges to a `delta function' or evaluation measure

1710: at $(0,y_m)$.  Such measures are precisely the \emph{characters}, meaning they are homomorphisms

1711: on the `algebra of observables' and as such are extreme points.  This shows that there is

1712: no phase transition in this antiferromagnetic model.

1713: It is worthwhile to elaborate on such technicalities to show that the weak convergence of the

1714: Fourier transformed measures implies that the sequence of measures $\brn{\cdot}$

1715: converges weakly.  With \lemRef{L:gibbsConv}, this implies the Gibbs measures

1716: $\brn{\cdot}$ converge weakly to the infinite-agent measure $\br{\cdot}$.

1717:

1718:

1719:

1720: %

1721: %

1722: %  convergence of Gibbs measure

1723: %

1724: %

1725: %

1726: %

1727: %

1728: \begin{lemma} \label{L:gibbsConv}

1729: The weak convergence of the sequence $\brfn{\cdot}$ of measures

1730:     \footnote{This is the one-dimensional Fourier transform of the Gibbs

1731:     		  measures $\brn{\cdot}$ in the variable $s_\np\equiv(q_1+\cdots+q_\np)/\sqrt{n}$

1732:     		  that is generated by the partition function in \eqref{E:par3}}

1733: implies the weak convergence of the sequence of Gibbs measures $\brn{\cdot}$.

1734: \end{lemma}

1735: \begin{proof}

1736: Let $\mathfrak{A}_\np$ be the \emph{algebra of observables for \np\ agents}, which is simply

1737: $L^\infty([\qmin,\qmax]^\np, dq_1\cdot dq_\np)$.

1738: The algebra of observables (functions) for an infinite number of agents,

1739: $\mathfrak{A}_\infty$, is the inductive limit (von Neumann algebra sense) of the finite-agent

1740: algebras $\mathfrak{A}_\np$.

1741: As a result, the $\mathfrak{A}_\np$ can be considered subalgebras of $\mathfrak{A}_\infty$

1742: and any weak limit of finite-agent measures is defined on $\mathfrak{A}_\infty$.

1743:

1744: Consider two distinct limit points $\br{\cdot},\br{\cdot}'\in\mathfrak{A}_\infty^*$

1745: which are limits of subsequences $\brn[1]{\cdot}$ and $\brn[2]{\cdot}$, respectively.

1746: There is a function $g\in\mathfrak{A}_\infty$ on which $\br{\cdot}$ and $\br{\cdot}'$ differ.

1747: By approximating $g$ closely enough with a function in some $\mathfrak{A}_{n_0}$ we may assume

1748: $g\in\mathfrak{A}_{n_0}$.

1749:

1750: The integral $\brn[i]{g}$ over the $q_j$, $1\leq j\leq n_i$ can be manipulated into an integral

1751: over the variable $\hat{t}$ as was done with $\eqref{E:par3}$, upon which

1752: it becomes an integral $\brfn[i]{G}$, where $G=G(\hat{t})$ is a function independent of $\np_i$.

1753: This shows $\br{g}=\lim_{\np_1}\brfn[1]{G}$ and $\br{g}'=\lim_{\np_2}\brfn[2]{G}$,

1754: which implies the assertion.

1755: \end{proof}

1756:

1757: \begin{remark}

1758: With $\vec{q}=\vq$, $g=g(\vec{q})$, note that \eqref{E:approx} inserts the Fourier transform

1759: $\mathfrak{F}$ (of the Gibbs measure)

1760: into the $\brn{g}$ integral, and \eqref{E:par2} shows what the dual of the

1761: Fourier transform, $\mathfrak{F}^*$ does to $g$.

1762: 	\footnote{In particular, $\mathfrak{F}^*[g](\sqrt{b}t)= \int \exp(i\sqrt{b}t s_\np) 		    E[g(s_\np,\mathbf{s}_\np^\perp)\chi_Q(s,\mathbf{s}_\np^\perp)|s_\np]d s_\np$,

1763: 		    where $\mathbf{s}_\np^\perp$ are integration variables orthogonal

1764: 		    to $s_\np$, $\chi_Q$ is the indicator function on

1765: 		    $\vec{q}\in[-Q/2,Q/2]^\np$, and $E[\cdot|s_\np]$ is the conditional expectation

1766: 		    over the subalgebra of functions in the variables $\mathbf{s}_\np^\perp$.}

1767: The Fourier transform of the Gibbs measures need only converge on the subalgebra of

1768: functions generated by the $\mathfrak{F}^*[\mathfrak{A}_\np]$ in order for the sequence of

1769: Gibbs measures to converge.

1770: \end{remark}

1771:

1772:

1773:

1774:

1775:

1776: The expected value $\br{q_i}$ is then

1777: 	\begin{equation}

1778: 	F'(h) = \frac{h}{2b} + \frac{y_m(\beta,b,h)}{\beta Qb}.

1779: 	\label{E:magnetization}

1780: 	\end{equation}

1781:

1782: From \eqref{E:yNew}, we can see that

1783: 	\begin{equation}

1784: 	\lim_{\beta\to 0} F'(h) = \frac{h}{2b} + \left(\gamma - \frac{h}{2b}\right)

1785: 					= \gamma,

1786: 	\label{E:magInfTemp}

1787: 	\end{equation}

1788: which simply states that for completely irrational behavior (i.e.,`high temperature'),

1789: the agents will act randomly, and the Gibbs-Cournot measure will be uniform.  As such,

1790: the expected value $\br{q_i}$ over the interval $[\qmin,\qmax]$ is the average

1791: $\gamma=(\qmax+\qmin)/2$.

1792:

1793: In the completely rational limit (i.e., `zero temperature'), if $Q/2 + \gamma -h/(2b)>0$

1794: and $\eta\leq0$ (equivalently, if $\gamma\leq h/(2b)<\qmax$),

1795: then the curve $g(y)$ which equals the right side of \eqref{E:yNew} has a positive limit

1796: as $y\to-\infty$.  In this case, the line $h(y)=y/\beta$ intersects $g(y)$ at a point

1797: with $y$-value $y_m$ which has a finite limit as $\beta\to\infty$.  Thus

1798: $y_m/\beta\to 0$ as $\beta\to\infty$ and

1799: 	\begin{equation}

1800: 	\lim_{\beta\to\infty} F'(h) = \frac{h}{2b}

1801: 					\qquad\qquad\qquad\text{for\ }\gamma\leq h/(2b)<\qmax.

1802: 	\label{E:magInfTempHsmall}

1803: 	\end{equation}

1804: In the case when $h/(2b)\geq\qmax$, the curve $g(y)$ is always

1805: negative-valued.  As a result, $y_m\to-\infty$ as $\beta\to\infty$.  In this case,

1806: $g(y)$ can be used to evaluate $y_m/\beta$ and

1807: 	\begin{equation}

1808: 	\lim_{\beta\to\infty} F'(h) = \frac{h}{2b} +

1809: 			\left( \frac{Q}{2} + \gamma -\frac{h}{2b} \right)

1810: 	       = \qmax \qquad\qquad\text{for\ } h/(2b)\geq\qmax.

1811: 	\label{E:magInfTempHlarge}

1812: 	\end{equation}

1813: Since the integral in \eqref{E:par3} is even in $\eta=b\gamma-h/2$,

1814: $\lim_{\beta\to\infty} F'(h)=\qmin$ when $h/(2b)\leq\qmin$.

1815: If $b\gamma-h/2>0$, then $g(y)$ in \eqref{E:yNew} is shifted up and $g(0)>0$.  With this,

1816: $g(y)$ can only intersect the line $y/\beta$ at positive $y_m$.  It is then seen that

1817: $y_m>0$ when $h/(2b)<\gamma$ and $y_m\leq 0$ when $h/(2b)\geq\gamma$.

1818:

1819: In summary, we see that the expected value $\br{q_i}$ in the irrational case is the average

1820: value $(\qmax+\qmin)/2$.  In the completely rational case, we recover the classical

1821: Nash equilibrium value $\br{q_i}=h/(2b)$ when $\qmin\leq h/(2b)\leq\qmax$. When $h/(2b)<\qmin$

1822: ($>\qmax$) then $\br{q_i}$ will be $\qmin$ ($\qmax$).

1823: For finite $\beta$, the equilibrium $\br{q_i}$ will be smaller (larger) than $h/(2b)$ if

1824: $h/(2b)$ itself is larger (smaller) than the average $\gamma=(\qmax+\qmin)/2$.

1825: Furthermore the graphical analysis below \eqref{E:yNew} shows that $|y_m|/\beta$ is a

1826: decreasing function of $\beta$, and $y_m<0(>0)$ when $h/(2b)>\gamma(<\gamma)$.

1827: As a result, the agents deviate farther from the Nash equilibrium $h/(2b)$

1828: and closer to the average $\gamma$ as $\beta$ decreases

1829: (i.e., as deviations from rationality increase).

1830:

1831: The \emph{volatility} (i.e., susceptibility)

1832: 	\begin{equation}

1833: 	\vty=\frac{1}{\beta}F''(h)

1834: 	=\lim_{n\to\infty} \frac{1}{\np} \sum_{i,j=1}^{\np}\br{q_iq_j}-\br{q_i}\br{q_j}

1835: 	\end{equation}

1836: can be evaluated in a similar manner.  Let

1837: 	\begin{equation}

1838: 	G(h,y) = \frac{bQ^2}{2}\left(-\coth y +\frac{1}{y}\right)

1839: 				+Q \left( b\gamma - \frac{h}{2} \right)

1840: 				-\frac{y}{\beta},

1841: 	\end{equation}

1842: which equals zero at points $(h, y_m(h))$.

1843: Then

1844: 	\begin{equation}

1845: 	\frac{1}{\beta}\frac{dy_m}{dh} =

1846: 	\frac{Q}{\beta bQ^2 (\sinh^{-2}y_m - y_m^{-2}) - 2}.

1847: 	\label{E:dydh}

1848: 	\end{equation}

1849: In the irrational limit, $\lim_{\beta\to 0}y_m=0$ and

1850: $\lim_{\beta\to 0} =(\sinh^{-2}y_m - y_m^{-2}) = -1/3$.

1851: Differentiating \eqref{E:magnetization} results in

1852: 	\begin{equation}

1853: 	\vty =

1854: 		\frac{Q^2(\sinh^{-2}y_m - y_m^{-2})/2}{

1855: 		      \beta bQ^2(\sinh^{-2}y_m - y_m^{-2}) - 2 },

1856: 	\label{E:vol}

1857: 	\end{equation}

1858: and as a result

1859: 	\begin{equation}

1860: 	\lim_{\beta\to 0} \vty = \frac{Q^2}{12}.

1861: 	\label{E:volInfTemp}

1862: 	\end{equation}

1863:

1864: In the completely rational limit, either $y_m$ converges to a finite value

1865: (when $(\qmin+\qmax)/2<h/(2b)<\qmax$) or goes to $-\infty$ (when $h/(2b)\geq\qmax$).

1866: If $y_m$ converges to a finite limit, then the denominator of \eqref{E:vol} goes to

1867: $-\infty$ and $\lim_{\beta\to\infty}\vty=0$.  If $y_m$ goes to $-\infty$,

1868: then $\vty\leq(Q^2/4)|\sinh^{-2}y_m - y_m^{-2}|$ and

1869: $\lim_{\beta\to\infty}\vty=0$.  Using that \eqref{E:par3} in even in $\eta$ as before,

1870: 	\begin{equation}

1871: 	\lim_{\beta\to\infty} \vty = 0.

1872: 	\label{E:volZeroTemp}

1873: 	\end{equation}

1874:

1875: In summary, when agents behave randomly in the irrational limit, the volatility is simply the variance of a uniform random variable.  Each agent becomes decorrolated from the other agents and the volatility is the limiting-average of the volatility of each agent.  When agents

1876: are completely rational they all output at the Nash equilibrium values and do not deviate

1877: from that, which is reflected by zero volatility.

1878:

1879:

1880:

1881:

1882:

1883:

1884:

1885:

1886:

1887:

1888:

1889:

1890:

1891:

1892: %

1893: %

1894: %

1895: % BIBLIOGRAPHY

1896: %

1897: %

1898: %

1899: \begin{thebibliography}{10}	%10=widest entry

1900: \bibitem{AGH1}

1901: 	S. P. Anderson, J. K. Goeree, and C. A. Holt:

1902: 	{Stochastic Game Theory: Adjustment to Equilibrium Under Noisy

1903: 			 Directional Learning},

1904: 	under revision for Review of Economic Studies, 1997

1905: \bibitem{AGH2}

1906: 	S. P. Anderson, J. K. Goeree, and C. A. Holt:

1907: 	{The Logit Equilibrium: A Perspective on Intuitive Behavioral Anomalies},

1908: 	Southern Economic Journal, July 2002, {\bf 69(1)}, p.21-47

1909: \bibitem{AUC}

1910: 	H. Al-Wahsh, M. Urb'{a}n, A. Czachor:

1911: 	{Exact solutions for a model antiferromagnet with identical

1912: 	coupling between spins},

1913: 	J. Magn. Mater., 1998, {\bf 185}, p.144-158

1914: \bibitem{A}

1915: 	W. B. Arthur:

1916: 	{Inductive Reasoning and Bounded Rationality},

1917: 	American Economic Review (Papers and Proceedings),1994, {\bf 84}, p.406-411

1918: \bibitem{BS}

1919: 	K. Binmore and L. Samuelson:

1920: 	{Muddling Through: Noisy Equilibrium Selection},

1921: 	J. Econ. Theory, 1997, {\bf 74}, p.235-265

1922: \bibitem{BSV}

1923: 	K. Binmore, L. Samuelson, R. Vaughan

1924: 	{Musical Chairs: Modeling Noisy Evolution},

1925: 	Games and Econ. Behavior, 1995, {\bf 11}, p.1-35

1926: \bibitem{Bl}

1927: 	L. Blume:

1928: 	{Population Games},

1929: 	Working Papers, Santa Fe Institute, 1996, 96-04-022

1930: \bibitem{BlD}

1931: 	L. Blume, S. Durlauf:

1932: 	{Equilibrium Concepts for Social Interaction Models},

1933: 	Working Papers 7, Wisc. Mad. - Soc. Sys., 2002

1934: \bibitem{BCP}

1935: 	E. Burgos, H. Ceva, R.P.J. Perazzo:

1936: 	{Thermal Treatment of the Minority Game},

1937: 	Phys. Rev. E, 2002, {\bf 65}, p.36711

1938: \bibitem{Cannas}

1939: 	Sergio Cannas, private communication.

1940: \bibitem{CGT}

1941: 	S. Cannas, P. Gleiser, F. Tamarit:

1942: 	{Two dimensional Ising model with long-range competing interactions},

1943: 	Recent Research Developments in Physics,

1944: 	(Transworld Research Network, to appear)

1945: \bibitem{Ca}

1946: 	A. Cavagna:

1947: 	{Irrelevance of memory in the minority game},

1948: 	Phys. Rev. E, 1999, {\bf 59}, R3783-R3786

1949: \bibitem{CGGS}

1950: 	A. Cavagna, J. P. Garrahan, I. Giardina, and D. Sherrington:

1951: 	{A thermal model for adaptive competition in a market},

1952: 	Phys. Rev. Lett., 1999, {\bf 83}, p.4429-4432

1953: \bibitem{CZ}

1954: 	D. Challet and Y.-C. Zhang:

1955: 	{Emergence of Cooperation and Organization in an Evolutionary Game},

1956: 	Physica A, 1997, {\bf 246}, p.407

1957: \bibitem{MaCO}

1958: 	D. Challet, M. Marsili, G. Ottino:

1959: 	{Shedding Light on El Farol},

1960: 	submitted to Physica A (2003), preprint cond-mat/0306445. To appear.

1961: \bibitem{C}

1962: 	A. C. C. Coolen:

1963: 	{Non-equilibrium statistical mechanics of Minority Games},

1964: 	for Proceedings of Cergy 2002 Conference

1965: \bibitem{FJL}

1966: 	E. Friedman, S. Johnson, A. S. Landesberg:

1967: 	{The Emergence of Correlations in Studies of Global Economic

1968: 	 Inter-dependence and Contagion},

1969: 	 Quantitative Finance, 2003, {\bf 3}, p.296

1970: \bibitem{FL}

1971: 	E. Friedman and A. S. Landesberg:

1972: 	{Large-Scale Synchrony in Weakly Interacting Automata},

1973: 	Phys. Rev. E, 2001, {\bf 63}, p.051303

1974: \bibitem{HSa1}

1975: 	J. Hofbauer and W. Sandholm:

1976: 	{On the Global Convergence of Stochastic Fictitious Play},

1977: 	Econometrica, 2002, {\bf 70}, p.2265-2294

1978: \bibitem{HSa2}

1979: 	J. Hofbauer and W. Sandholm:

1980: 	{Evolution in Games with Randomly Disturbed Payoffs},

1981: 	preprint

1982: \bibitem{K}

1983: 	M. Kac in: M. Chretien, E.P. Gross, S. Deser (editors)

1984: 	{Statistical Physics, Phase Transitions, and Superfluidity, vol. 1},

1985: 	Gordon and Breach, New York, 1968, p. 241

1986: \bibitem{LVS}

1987: 	Y. Li, A. VanDeemen, R. Savit:

1988: 	{The Minority Game with Variable Payoffs},

1989: 	http://arxiv.org/abs/nlin/0002004

1990: \bibitem{L}

1991: 	D. Luce:

1992: 	{Individual Choice Behavior},

1993: 	New York, Wesley, 1959

1994: \bibitem{Ma}

1995: 	M. Marsili:

1996: 	{On the multinomial Logit model},

1997: 	Physica A, 1999, {\bf 269}, p.9

1998: \bibitem{MaC}

1999: 	M. Marsili and D. Challet:

2000: 	{Trading Behavior and Excess Volatility in Toy Markets},

2001: 	Adv. Complex Systems, 2001, {\bf 3} p.1-14

2002: \bibitem{MaC2}

2003: 	M. Marsili and D. Challet:

2004: 	{Phase Transition and Symmetry Breaking in the Minority Game},

2005: 	Phys. Rev. E, 1999, {\bf 60}, R6271

2006: \bibitem{MaCZ}

2007: 	M. Marsili, D. Challet, and Y.-C. Zhang:

2008: 	{Exact solution of a modified El Farol's bar problem:

2009: 		Efficiency and the role of market impact},

2010: 	Physica A, 2000, {\bf 280}, p.522

2011: \bibitem{MaZ}

2012: 	M. Marsili and Y.-C. Zhang:

2013: 	{Stochastic Dynamics in Game Theory},

2014: 	Proceedings of the Budapest conference ECONOPHYSICS, Kluwer, 1998

2015: \bibitem{MaZ2}

2016: 	M. Marsili and Y.-C. Zhang:

2017: 	{Fluctuations around Nash Equilibria in Game Theory},

2018: 	Physica A, 1997, {\bf 245}, p.181

2019: \bibitem{MI}

2020: 	A.B. MacIsaac, J.P. Whitehead, M.C. Robinson, K. De'Bell:

2021: 	{Striped phases in two-dimensional dipolar ferromagnets}

2022: 	Phys. Rev. B, 1995, {\bf 51}, p.16033.

2023: \bibitem{McF}

2024: 	D. McFadden:

2025: 	{Conditional Logit Analysis of Qualitative Choice Behavior},

2026: 	Frontiers of Econometrics (P. Zarembka Ed.), New York, Academic Press, 1973

2027: \bibitem{M}

2028: 	E. Milotti:

2029: 	{Exactly solved dynamics for an infinite-range spin system.

2030: 	II. Antiferromagnetic interaction},

2031: 	Phys. Rev. E, 2002, {\bf 65}, p.027102

2032: \bibitem{MM}

2033: 	Y. Mu and Y. Ma:

2034: 	{Self-organizing stripe patterns in two-dimensional frustrated

2035: 	 systems with competing interactions}

2036: 	Phys. Rev. B, 2003, {\bf 67}, p.014110 (6 pages)

2037: \bibitem{MP}

2038: 	R. D. McKelvey and T. R. Palfrey:

2039: 	{Quantal Response Equilibria for Normal Form Games},

2040: 	Games and Economic Behavior, 1995, {\bf 10}, p.6-38

2041: \bibitem{MS}

2042: 	D. Monderer and L. S. Shapley:

2043: 	{Potential Games},

2044: 	Games and Economic Behavior, 1996, {\bf 14}, p.124-143

2045: \bibitem{N}

2046: 	J. F. Nagle:

2047: 	{Ising Chain with Competing Interactions}

2048: 	Phys. Rev. A, 1970, {\bf 2} no.5, p.2124-2128

2049: \bibitem{PSBS}

2050: 	H. V. D. Paranuk, R. Savit, S. A. Brueckner, J. Sauter:

2051: 	{A Technical Overview of the AORIST Project},

2052: 	preprint

2053: \bibitem{PGGS}

2054: 	V. Plerou, P. Gopikrishnan, X. Gabaix, H. Stanley:

2055: 	{Quantifying Stock Price Response to Demand Fluctuations},

2056: 	Phys. Rev. E, 2002, {\bf 66} p.027104-1--027104-4

2057: \bibitem{R}

2058: 	F. Reif

2059: 	{Fundamentals of Statistical and Thermal Physics},

2060: 	McGraw-Hill, Inc., 1965

2061: \bibitem{Sa1}

2062: 	W. Sandholm:

2063: 	{Potential Games with Continuous Player Sets},

2064: 	J. Econ. Theory, 2001, {\bf 97} p.81-108

2065: \bibitem{Sa2}

2066: 	W. Sandholm:

2067: 	{Excess Payoff Dynamics, Potential Dynamics, and Stable Games},

2068: 	Working Papers 5, Wisc. Mad. - Soc. Sys., 2003

2069: \bibitem{Si}

2070: 	B. Simon

2071: 	{The statistical mechanics of lattice gases, vol. 1},

2072: 	Princeton University Press, Princeton, N.J., 1993

2073: \bibitem{So}

2074: 	A.D. Sokal:

2075: 	{Monte Carlo Methods in Statistical Mechanics: Foundations and New Algorithms},

2076: 	Lectures at the Cargese Summer School on Functional Integration: Basics and Applications, 1996

2077: \bibitem{VG}

2078: 	A. P. Vieira and L. L. Gon\c{c}alves:

2079: 	{One-dimensional Ising model with long-range and random short-range interactions},

2080: 	J. Magn. and Magn. Mater., 1999, {\bf 192}, p.177-190

2081: \end{thebibliography}

2082:

2083:

2084:

2085:

2086: \end{document}