184782c4b62e8bcf.tex
1: \begin{abstract}
2: Tracking objects in 3D space and predicting their \acs{6DoF} pose is an essential task in computer vision.
3: State-of-the-art approaches often rely on object texture to tackle this problem.
4: However, while they achieve impressive results, many objects do not contain sufficient texture, violating the main underlying assumption.
5: In the following, we thus propose \textit{ICG}, a novel probabilistic tracker that fuses region and depth information and only requires the object geometry.
6: Our method deploys correspondence lines and points to iteratively refine the pose.
7: We also implement robust occlusion handling to improve performance in real-world settings.
8: Experiments on the \textit{YCB-Video}, \textit{OPT}, and Choi datasets demonstrate that, even for textured objects, our approach outperforms the current state of the art with respect to accuracy and robustness.
9: At the same time, \textit{ICG} shows fast convergence and outstanding efficiency, requiring only $1.3\,\unit{ms}$ per frame on a single CPU core.
10: Finally, we analyze the influence of individual components and discuss our performance compared to deep learning-based methods.
11: The source code of our tracker is publicly available\footnote{\url{https://github.com/DLR-RM/3DObjectTracking}}.
12: \end{abstract}