abstract:e820e5dba61b691c.tex

1: \begin{abstract}

2: This paper handles a kind of strategic game called potential games

3: and develops a novel learning algorithm

4: Payoff-based Inhomogeneous Partially Irrational Play (PIPIP).

5: The present algorithm is based on Distributed Inhomogeneous

6: Synchronous Learning (DISL) presented in an existing work but, unlike DISL,

7: PIPIP allows agents to make irrational decisions with a specified probability,

8: i.e. agents can choose an action with a low utility from the past actions

9: stored in the memory.

10: Due to the irrational decisions,

11: we can prove convergence in probability

12: of collective actions to potential function maximizers.

13: Finally, we demonstrate the effectiveness of the present algorithm

14: through experiments on a sensor coverage problem.

15: It is revealed through the demonstration that

16: the present learning algorithm successfully leads

17: agents to around potential function maximizers

18: even in the presence of undesirable Nash equilibria.

19: We also see through the experiment

20: with a moving density function that PIPIP has adaptability

21: to environmental changes.

22: \end{abstract}