abstract:6fcdb26883d21a98.tex

1: \begin{abstract}

2: Sparse Tucker Decomposition (STD) algorithms learn a core tensor and a group of factor matrices to obtain an optimal low-rank representation feature for the

3: \underline{H}igh-\underline{O}rder, \underline{H}igh-\underline{D}imension, and \underline{S}parse \underline{T}ensor (HOHDST).

4: However,

5: existing STD algorithms face the problem of intermediate variables explosion which results from the fact that the formation of those variables, i.e., matrices Khatri-Rao product, Kronecker product, and

6: matrix-matrix multiplication, follows the whole elements in sparse tensor.

7: The above problems prevent deep fusion of efficient computation and big data platforms.

8: To overcome the bottleneck,

9: a novel stochastic optimization strategy (SGD$\_$Tucker) is proposed for STD

10: which can automatically divide the high-dimension intermediate variables

11: into small batches of intermediate matrices.

12: Specifically, SGD$\_$Tucker only follows the randomly selected small samples rather than the whole elements, while maintaining the overall accuracy and convergence rate.

13: In practice,

14: SGD$\_$Tucker features the two distinct advancements over the state of the art.

15: First, SGD$\_$Tucker can prune the communication overhead for the core tensor in distributed settings.

16: Second, the low data-dependence of SGD$\_$Tucker enables fine-grained parallelization,

17: which makes SGD$\_$Tucker obtaining lower computational overheads with the same accuracy.

18: Experimental results show that SGD$\_$Tucker runs at least 2$X$ faster than the state of the art.

19: \renewcommand{\raggedright}{\leftskip=0pt \rightskip=0pt plus 0cm}

20: \raggedright

21: \end{abstract}

22: