abstract:e3a071d9cfd593b0.tex

1: \begin{abstract}

2: % Definition of TNN

3: We propose \textit{\tensornet ({\tensornetshort}s)}, a generalization of existing neural networks by extending tensor operations on low order operands to those on high order ones.

4: % Challenge of TNN

5: The problem of parameter learning is challenging, as it corresponds to hierarchical nonlinear tensor decomposition.

6: % Solution to the problem

7: We propose to solve the learning problem using stochastic gradient descent by deriving nontrivial backpropagation rules in generalized tensor algebra we introduce.

8: % Benefits of TNN

9:  Our proposed {\tensornetshort}s has three advantages over existing neural networks:

10:  (1) {\tensornetshort}s naturally apply to high order input object and thus preserve the multi-dimensional structure in the input, as there is no need to flatten the data.

11:  (2) {\tensornetshort}s interpret designs of existing neural network architectures.

12:  (3) Mapping a neural network to {\tensornetshort}s with the same expressive power results in a {\tensornetshort} of fewer parameters.

13:  % Compression algorithms

14:  \ouralgo of neural network improves existing \ourbaseline methods as {\tensornetshort}s exploit two other types of {\em invariant structures}, periodicity and modulation, in addition to the low rankness.

15:  % Experimental results

16: Experiments on LeNet-5 (MNIST), ResNet-32 (CIFAR10) and ResNet-50 (ImageNet) demonstrate that our \ouralgo outperforms (5\% test accuracy improvement universally on CIFAR10) the state-of-the-art \ourbaseline methods under the same compression rate, besides achieving orders of magnitude faster convergence rates due to the efficiency of {\tensornetshort}s.

17: \end{abstract}

18: