1: \begin{abstract}
2:
3: Learning 3D point sets with rotational invariance is an important and challenging problem in machine learning. Through rotational invariant architectures, 3D point cloud neural networks are relieved from requiring a canonical global pose and from exhaustive data augmentation with all possible rotations.
4: %
5: In this work we introduce a rotational invariant neural network by combining recently introduced vector neurons with self attention layers to build a point cloud vector neuron transformer network (VNT-Net).
6: %
7: Vector neurons are known for their simplicity and versatility in representing SO(3) actions and thereby incorporated in common neural operations.
8: %
9: Similarly, Transformer architectures have gained popularity and recently were showed successful for images by applying directly on sequences of image patches and achieving superior performance and convergence.
10: % VIT took mach more resources then convolutional by alot but in point cloud the Transformer with allot of changes could be comparable to convolutional computational resources.
11: In order to benefit from both worlds we combine the two structures by mainly showing how to adapt the multi-headed attention layers to comply with vector neurons operations.
12: %
13: Through this adaptation attention layers become SO(3) and the overall network becomes rotational invariant.
14: %
15: Experiments demonstrate that our network efficiently handles 3D point cloud objects in arbitrary poses. We also show that our network achieves higher accuracy when compared to related state-of-the-art methods and requires less training due to a smaller number of hyper parameters in common classification and segmentation tasks .
16:
17: \end{abstract}
18: