abstract:04e58a9238e1d55e.tex

1: \begin{abstract}

2:

3:

4: For multi-agent reinforcement learning systems (MARLS), the problem formulation generally involves investing massive reward engineering effort specific to a given problem.

5: However, this effort often cannot be translated to other problems; worse, it gets wasted when system dynamics change drastically.

6: This problem is further exacerbated in sparse reward scenarios, where a meaningful heuristic can assist in the policy convergence task.

7: We propose \textbf{GOV}erned \textbf{R}eward \textbf{E}ngineering \textbf{K}ernels (GOV-REK), which dynamically assign reward distributions to agents in MARLS during its learning stage.

8: We also introduce governance kernels, which exploit the underlying structure in either state or joint action space for assigning meaningful agent reward distributions.

9: During the agent learning stage, it iteratively explores different reward distribution configurations with a Hyperband-like algorithm to learn ideal agent reward models in a problem-agnostic manner.

10: Our experiments demonstrate that our meaningful reward priors robustly jumpstart the learning process for effectively learning different MARL problems.

11: \\

12: \\

13: \textbf{\textit{Keywords: }}Cooperative Multi-Agent Systems, Sparse Reinforcement Learning, Robust Multi-Agent Systems, Reward Shaping

14:

15: \end{abstract}

16: