1: \begin{abstract}
2: It is challenging to control a soft robot, where reinforcement learning methods have been applied with promising results.
3: However, due to the poor sample efficiency, reinforcement learning methods require a large collection of training data, which limits their applications.
4: In this paper, we propose a Q-learning controller for a physical soft robot, in which pre-trained models using data from a rough simulator are applied to improve the performance of the controller.
5: We implement the method on our soft robot, i.e., Honeycomb Pneumatic Network (HPN) arm.
6: The experiments show that the usage of pre-trained models can not only reduce the amount of the real-world training data, but also greatly improve its accuracy and convergence rate.
7: \end{abstract}
8: