1: \begin{abstract}
2: We study a framework where agents have to avoid aversive signals.
3: The agents are given only partial information, in the form of features that are projections of task states.
4: Additionally, the agents have to cope with non-determinism, defined as unpredictability on the way that actions are executed.
5: The goal of each agent is to define its behavior based on feature-action pairs that reliably avoid aversive signals.
6: We study a learning algorithm, called \aLearn, that exhibits fixpoint convergence, where the belief of the allowed feature-action pairs eventually becomes fixed.
7: \aLearn\ is parameter-free and easy to implement.
8: \end{abstract}
9: