ad3fffe0c6663bb6.tex
1: \begin{abstract}
2: Creating incentives for cooperation is a challenge in natural and artificial systems.  
3: One potential answer is reputation, whereby agents trade the immediate cost of cooperation for the future benefits of having a good reputation. 
4: Game theoretical models have shown that specific social norms can make cooperation stable, but how agents  can independently learn to establish effective reputation mechanisms on their own is less understood. 
5: We use a simple model of reinforcement learning to show that reputation mechanisms generate two coordination problems: agents need to learn how to coordinate on the meaning of existing reputations and collectively agree on a social norm to assign reputations to others based on their behavior. 
6: These coordination problems exhibit multiple equilibria, some of which effectively establish cooperation. 
7: When we train agents with a standard Q-learning algorithm in an environment with the presence of reputation mechanisms, convergence to undesirable equilibria is widespread. 
8: We propose two mechanisms to alleviate this: \emph{(i)} seeding a proportion of the system with fixed agents that steer others towards good equilibria; and \emph{(ii)}, intrinsic rewards based on the idea of introspection, i.e., augmenting agents' rewards by an amount proportionate to the performance of their own strategy against themselves. 
9: A combination of these simple mechanisms is successful in stabilizing cooperation, even in a fully decentralized version of the problem where agents learn to use and assign reputations simultaneously. 
10: We show how our results relate to the literature in Evolutionary Game Theory, and  discuss implications for artificial, human and hybrid systems, where reputations can be used as a way to establish trust and cooperation.
11: \end{abstract}
12: