1: \begin{abstract}
2: We present a self-supervised approach to estimate flow in camera image and top-view grid map sequences using fully convolutional neural networks in the domain of automated driving.
3: We extend existing approaches for self-supervised optical flow estimation by adding a regularizer expressing motion consistency assuming a static environment.
4: However, as this assumption is violated for other moving traffic participants we also estimate a mask to scale this regularization.
5: Adding a regularization towards motion consistency improves convergence and flow estimation accuracy.
6: Furthermore, we scale the errors due to spatial flow inconsistency by a mask that we derive from the motion mask.
7: This improves accuracy in regions where the flow drastically changes due to a better separation between static and dynamic environment.
8: We apply our approach to optical flow estimation from camera image sequences, validate on odometry estimation and suggest a method to iteratively increase optical flow estimation accuracy using the generated motion masks.
9: Finally, we provide quantitative and qualitative results based on the KITTI odometry and tracking benchmark for scene flow estimation based on grid map sequences.
10: We show that we can improve accuracy and convergence when applying motion and spatial consistency regularization.
11: \end{abstract}
12: