1: \begin{abstract}
2:
3:
4: For multi-agent reinforcement learning systems (MARLS), the problem formulation generally involves investing massive reward engineering effort specific to a given problem.
5: However, this effort often cannot be translated to other problems; worse, it gets wasted when system dynamics change drastically.
6: This problem is further exacerbated in sparse reward scenarios, where a meaningful heuristic can assist in the policy convergence task.
7: We propose \textbf{GOV}erned \textbf{R}eward \textbf{E}ngineering \textbf{K}ernels (GOV-REK), which dynamically assign reward distributions to agents in MARLS during its learning stage.
8: We also introduce governance kernels, which exploit the underlying structure in either state or joint action space for assigning meaningful agent reward distributions.
9: During the agent learning stage, it iteratively explores different reward distribution configurations with a Hyperband-like algorithm to learn ideal agent reward models in a problem-agnostic manner.
10: Our experiments demonstrate that our meaningful reward priors robustly jumpstart the learning process for effectively learning different MARL problems.
11: \\
12: \\
13: \textbf{\textit{Keywords: }}Cooperative Multi-Agent Systems, Sparse Reinforcement Learning, Robust Multi-Agent Systems, Reward Shaping
14:
15: \end{abstract}
16: