abstract:448e45b1d79149bd.tex

1: \begin{abstract}

2: In the original version of the Kolkata Paise

3: Restaurant (KPR) problem, where each of the  $N$

4: agents (or players) choose independently every

5: day (updating their strategy based on past

6: experience of failures) among the $N$ restaurants,

7: where he/she will be alone or lucky enough to be

8: picked up  randomly from the crowd who arrived at

9: that  restaurant that day, to get the only food

10: plate served there. The objective of the agents

11: are to learn themselves in the minimum (learning)

12: time to have maximum success or utilization

13: probability ($f$). A dictator can easily solve

14: the problem with $f = 1$ in no time, by asking

15: every one to form a queue and go to the respective

16: restaurant, resulting in no fluctuation and full

17: utilization from the first day (convergence time

18: $\tau = 0$). It has already been shown that if

19: each agent chooses randomly the restaurants, $f

20: = 1 - e^{-1} \simeq 0.63$ and $\tau = 0$, while

21: the crowd avoiding (CA) strategy (determined by

22: yesterday's crowd size at the chosen restaurant)

23: gives ($f \simeq 0.80$) in finite (of order ten)

24: convergence time ($\tau$). Many numerical studies

25: of modified  learning strategies, actually

26: indicated increased value of $f = 1 - \alpha$

27: for $\alpha \to 0$, with $\tau \sim 1/\alpha$.

28: We show here using Monte Carlo technique, a

29: modified Greedy Crowd Avoiding (GCA) Strategy can

30: assure full utilization  ($f = 1$)  in convergence

31: time $\tau = e N$, where $e$ denotes the Euler

32: number. This observation perhaps suggests that using non-dictated

33: strategies for KPR, full utilization

34: can never be collectively learned or achieved

35: in finite convergence time, when $N$, the number

36: of customers or of restaurants goes to infinity.

37:

38:

39: \end{abstract}

40: