b7fc8e3b49996587.tex
1: \begin{abstract}
2: This paper deals with a new accelerated path integral method, which iteratively searches optimal controls with a small number of iterations.
3: This study is based on the recent observations that a path integral method for reinforcement learning can be interpreted as gradient descent.
4: This observation also applies to an iterative path integral method for optimal control, which sets a convincing argument for utilizing various optimization methods for gradient descent, such as momentum-based acceleration, step-size adaptation and their combination.
5: We introduce these types of methods to the path integral and demonstrate that momentum-based methods, like Nesterov Accelerated Gradient and Adam, can significantly improve the convergence rate to search for optimal controls in simulated control systems.
6: We also demonstrate that the accelerated path integral could improve the performance on model predictive control for various vehicle navigation tasks.
7: Finally, we represent this accelerated path integral method as a recurrent network, which is the accelerated version of the previously proposed path integral networks (PI-Net). We can train the accelerated PI-Net more efficiently for inverse optimal control with less RAM than the original PI-Net.
8: \end{abstract}
9: