1: \begin{abstract}
2: We formulate computation offloading as a decentralized decision-making problem with autonomous agents. We design an interaction mechanism that incentivizes agents to align private and system goals by balancing between competition and cooperation. The mechanism provably has Nash equilibria with optimal resource allocation in the static case. For a dynamic environment, we propose a novel multi-agent online learning algorithm that learns with partial, delayed and noisy state information, and a reward signal that reduces information need to a great extent. Empirical results confirm that through learning, agents significantly improve both system and individual performance, e.g., $40\%$ offloading failure rate reduction, $32\%$ communication overhead reduction, up to $38\%$ computation resource savings in low contention, $18\%$ utilization increase with reduced load variation in high contention, and improvement in fairness. Results also confirm the algorithm's good convergence and generalization property in significantly different environments.
3: %We propose a novel multi-agent online learning algorithm for decentralized computation offloading decision-making in vehicular network, with partial, delayed and noisy state information. We design an interaction mechanism based on auction, which incentivizes both competition and cooperation, and provably has Nash equilibria with optimal resource allocation. Empirical result confirms that through learning, the system achieves $20$-$38\%$ savings on computation resource in low contention, up to $10\%$ increase in offloading success rate, up to $18\%$ resource utilization increase and on average $9\%$ less load variation in high contention, $32\%$ communication overhead reduction, as well as improvement in fairness. The learned models are easily generalizable to other settings.
4: \end{abstract}
5: