abstract:dbdf55d22694551d.tex

1: \begin{abstract}

2: We study a framework where agents have to avoid aversive signals.

3: The agents are given only partial information, in the form of features that are projections of task states.

4: Additionally, the agents have to cope with non-determinism, defined as unpredictability on the way that actions are executed.

5: The goal of each agent is to define its behavior based on feature-action pairs that reliably avoid aversive signals.

6: We study a learning algorithm, called \aLearn, that exhibits fixpoint convergence, where the belief of the allowed feature-action pairs eventually becomes fixed.

7: \aLearn\ is parameter-free and easy to implement.

8: \end{abstract}

9: