e820e5dba61b691c.tex
1: \begin{abstract}
2: This paper handles a kind of strategic game called potential games
3: and develops a novel learning algorithm 
4: Payoff-based Inhomogeneous Partially Irrational Play (PIPIP).
5: The present algorithm is based on Distributed Inhomogeneous 
6: Synchronous Learning (DISL) presented in an existing work but, unlike DISL,
7: PIPIP allows agents to make irrational decisions with a specified probability, 
8: i.e. agents can choose an action with a low utility from the past actions 
9: stored in the memory.
10: Due to the irrational decisions,
11: we can prove convergence in probability
12: of collective actions to potential function maximizers. 
13: Finally, we demonstrate the effectiveness of the present algorithm
14: through experiments on a sensor coverage problem.
15: It is revealed through the demonstration that
16: the present learning algorithm successfully leads
17: agents to around potential function maximizers
18: even in the presence of undesirable Nash equilibria.
19: We also see through the experiment
20: with a moving density function that PIPIP has adaptability
21: to environmental changes.
22: \end{abstract}