abstract:687781455342a21b.tex

1: \begin{abstract}

2: We propose a deep reinforcement learning (DRL) methodology for the tracking, obstacle avoidance, and formation control of nonholonomic robots.

3: By separating vision-based control into a perception module and a controller module, we can train a DRL agent without sophisticated physics or 3D modeling.

4: In addition, the modular framework averts daunting retrains of an image-to-action end-to-end neural network, and provides flexibility in transferring the controller to different robots.

5: First, we train a convolutional neural network (CNN) to accurately localize in an indoor setting with dynamic foreground/background.

6: Then, we design a new DRL algorithm named Momentum Policy Gradient (MPG) for continuous control tasks and prove its convergence.

7: We also show that MPG is robust at tracking varying leader movements and can naturally be extended to problems of formation control.

8: Leveraging reward shaping, features such as collision and obstacle avoidance can be easily integrated into a DRL controller.

9:

10: \end{abstract}

11: