abstract:14c721c7278003b3.tex

1: \begin{abstract}

2: When has an agent converged?

3: %

4: %

5: Standard models of the reinforcement learning problem give rise to a straightforward definition of convergence: An agent converges when its behavior or performance in each environment state stops changing.

6: %

7: %

8: However, as we shift the focus of our learning problem from the environment's state to the agent's state, the concept of an agent's convergence becomes significantly less clear.

9: %

10: %

11: % This paper.

12: In this paper, we propose two complementary accounts of agent convergence in a framing of the reinforcement learning problem that centers around bounded agents.

13: %

14: %

15: % First: performance.

16: The first view says that a bounded agent has converged when the minimal number of states needed to describe the agent's future behavior cannot decrease.

17: %

18: %

19: % Second: behavior.

20: The second view says that a bounded agent has converged just when the agent's performance only changes if the agent's internal state changes.

21: %

22: %

23: % Relationship of two proposals.

24: We establish basic properties of these two definitions, show that they accommodate typical views of convergence in standard settings, and prove several facts about their nature and relationship.

25: %

26: %

27: % Impact.

28: We take these perspectives, definitions, and analysis to bring clarity to a central idea of the field.

29: \end{abstract}

30: