abstract:f705e5c9ceff37e0.tex

1: \begin{abstract}

2:

3: Learning 3D point sets with rotational invariance is an important and challenging problem in machine learning. Through rotational invariant architectures, 3D point cloud neural networks are relieved from requiring a canonical global pose and from exhaustive data augmentation with all possible rotations.

4: %

5: In this work we introduce a rotational invariant neural network by combining recently introduced vector neurons with self attention layers to build a point cloud vector neuron transformer network (VNT-Net).

6: %

7: Vector neurons are known for their simplicity and versatility in representing SO(3) actions and thereby incorporated in common neural operations.

8: %

9: Similarly, Transformer architectures have gained popularity and recently were showed successful for images by applying directly on sequences of image patches and achieving superior performance and convergence.

10: % VIT took mach more resources then convolutional by alot but in point cloud the Transformer with allot of changes could be comparable to convolutional computational resources.

11: In order to benefit from both worlds we combine the two structures by mainly showing how to adapt the multi-headed attention layers to comply with vector neurons operations.

12: %

13: Through this adaptation attention layers become SO(3) and the overall network becomes rotational invariant.

14: %

15: Experiments demonstrate that our network efficiently handles 3D point cloud objects in arbitrary poses. We also show that our network achieves higher accuracy when compared to related state-of-the-art methods and requires less training due to a smaller number of hyper parameters in common classification and segmentation tasks .

16:

17: \end{abstract}

18: