abstract:184782c4b62e8bcf.tex

1: \begin{abstract}

2: Tracking objects in 3D space and predicting their \acs{6DoF} pose is an essential task in computer vision.

3: State-of-the-art approaches often rely on object texture to tackle this problem.

4: However, while they achieve impressive results, many objects do not contain sufficient texture, violating the main underlying assumption.

5: In the following, we thus propose \textit{ICG}, a novel probabilistic tracker that fuses region and depth information and only requires the object geometry.

6: Our method deploys correspondence lines and points to iteratively refine the pose.

7: We also implement robust occlusion handling to improve performance in real-world settings.

8: Experiments on the \textit{YCB-Video}, \textit{OPT}, and Choi datasets demonstrate that, even for textured objects, our approach outperforms the current state of the art with respect to accuracy and robustness.

9: At the same time, \textit{ICG} shows fast convergence and outstanding efficiency, requiring only $1.3\,\unit{ms}$ per frame on a single CPU core.

10: Finally, we analyze the influence of individual components and discuss our performance compared to deep learning-based methods.

11: The source code of our tracker is publicly available\footnote{\url{https://github.com/DLR-RM/3DObjectTracking}}.

12: \end{abstract}