abstract:12eacc73bcb34811.tex

1: \begin{abstract}

2:

3: This paper proposes a new optimizer for deep learning, named d-AmsGrad.

4: In the real-world data, noise and outliers cannot be excluded from dataset to be used for learning robot skills.

5: This problem is especially striking for robots that learn by collecting data in real time, which cannot be sorted manually.

6: Several noise-robust optimizers have therefore been developed to resolve this problem, and one of them, named AmsGrad, which is a variant of Adam optimizer, has a proof of its convergence.

7: However, in practice, it does not improve learning performance in robotics scenarios.

8: This reason is hypothesized that most of robot learning problems are non-stationary, but AmsGrad assumes the maximum second momentum during learning to be stationarily given.

9: In order to adapt to the non-stationary problems, an improved version, which slowly decays the maximum second momentum, is proposed.

10: The proposed optimizer has the same capability of reaching the global optimum as baselines, and its performance outperformed that of the baselines in robotics problems.

11:

12: \end{abstract}