1: \begin{abstract}
2: In the original version of the Kolkata Paise
3: Restaurant (KPR) problem, where each of the $N$
4: agents (or players) choose independently every
5: day (updating their strategy based on past
6: experience of failures) among the $N$ restaurants,
7: where he/she will be alone or lucky enough to be
8: picked up randomly from the crowd who arrived at
9: that restaurant that day, to get the only food
10: plate served there. The objective of the agents
11: are to learn themselves in the minimum (learning)
12: time to have maximum success or utilization
13: probability ($f$). A dictator can easily solve
14: the problem with $f = 1$ in no time, by asking
15: every one to form a queue and go to the respective
16: restaurant, resulting in no fluctuation and full
17: utilization from the first day (convergence time
18: $\tau = 0$). It has already been shown that if
19: each agent chooses randomly the restaurants, $f
20: = 1 - e^{-1} \simeq 0.63$ and $\tau = 0$, while
21: the crowd avoiding (CA) strategy (determined by
22: yesterday's crowd size at the chosen restaurant)
23: gives ($f \simeq 0.80$) in finite (of order ten)
24: convergence time ($\tau$). Many numerical studies
25: of modified learning strategies, actually
26: indicated increased value of $f = 1 - \alpha$
27: for $\alpha \to 0$, with $\tau \sim 1/\alpha$.
28: We show here using Monte Carlo technique, a
29: modified Greedy Crowd Avoiding (GCA) Strategy can
30: assure full utilization ($f = 1$) in convergence
31: time $\tau = e N$, where $e$ denotes the Euler
32: number. This observation perhaps suggests that using non-dictated
33: strategies for KPR, full utilization
34: can never be collectively learned or achieved
35: in finite convergence time, when $N$, the number
36: of customers or of restaurants goes to infinity.
37:
38:
39: \end{abstract}
40: