1: \begin{abstract}
2: \sloppy{We consider Internet-based Master-Worker task computing systems, such as
3: SETI@home, where a master sends tasks to potentially unreliable workers, and
4: the workers execute and report back the result.} We model such computations using
5: evolutionary dynamics and consider three type of workers: {\em altruistic},
6: {\em malicious} and {\em rational}. Altruistic workers always compute and return
7: the correct result, malicious workers always return an incorrect result, and
8: rational (selfish) workers decide to be truthful or to cheat, based on the strategy
9: that increases their benefit. The goal of the master is to reach eventual correctness,
10: that is, reach a state of the computation that always receives the correct results.
11: To this respect, we propose a mechanism that uses {\em reinforcement learning} to induce
12: a correct behavior to rational workers; to cope with malice we employ {\em reputation
13: schemes}.
14: \ec{We analyze our reputation-based mechanism modeling it as a Markov chain
15: and we give provable guarantees under which truthful behavior can be ensured.
16: Simulation results, obtained using parameter values that are likely to occur in
17: practice, reveal interesting trade-offs between various metrics, parameters and reputation types,
18: affecting cost, time of convergence to a truthful behavior and tolerance to cheaters.
19: \vspace{-1em}}
20: \end{abstract}
21: