abstract:517e6552fa4fca8b.tex

1: \begin{abstract}

2: In online continual learning, a neural network incrementally learns from a non-i.i.d. data stream.

3: % without training to convergence before observing new data.

4: % and networks are not trained to convergence before observing new training data.

5: Nearly all online continual learning methods employ experience replay to simultaneously prevent catastrophic forgetting and underfitting on past data.

6: % maximize the approximate likelihood over all observed data.

7: Our work demonstrates a limitation of this approach:

8: networks trained with experience replay tend to have unstable optimization trajectories, impeding their overall accuracy.

9: Surprisingly, these instabilities persist even when the replay buffer stores all previous training examples,

10: suggesting that this issue is orthogonal to catastrophic forgetting.

11: % However, less attention is placed on how to optimize this approximate likelihood such that the optimization process is stable.

12: We minimize these instabilities through a simple modification of the optimization geometry.

13: % In this work, we highlight the importance of experience replay objective optimization on the model's predictive performance and introduce Layerwise Proximal Replay (LPR) as a possible solution.

14: Our solution, Layerwise Proximal Replay (LPR), balances learning from new and replay data while only allowing for gradual changes in the hidden activation of past data.

15: We demonstrate that LPR consistently improves replay-based online continual learning methods across multiple problem settings, regardless of the amount of available replay memory.

16: \end{abstract}

17: