7a4204f5cf2b5332.tex
1: \begin{abstract}
2:   Tensors have found application in a variety of fields, ranging from
3:   chemometrics to signal processing and beyond. In this paper, we
4:   consider the problem of multilinear modeling of \emph{sparse count}
5:   data. Our goal is to develop a descriptive tensor factorization
6:   model of such data, along with appropriate algorithms and theory. To
7:   do so, we propose that the random variation is best described via a
8:   Poisson distribution, which better describes the zeros observed in
9:   the data as compared to the typical assumption of a Gaussian
10:   distribution. Under a Poisson assumption, we fit a model to observed
11:   data using the negative log-likelihood score. We present a new
12:   algorithm for Poisson tensor factorization 
13:   called CANDECOMP--PARAFAC Alternating Poisson Regression
14:   (CP-APR) that is based on a majorization-minimization approach. It
15:   can be shown that CP-APR is a generalization of the Lee-Seung
16:   multiplicative updates. We show how to prevent the algorithm from
17:   converging to non-KKT points and prove convergence of CP-APR under
18:   mild conditions. We also explain how to implement CP-APR for
19:   large-scale sparse tensors and present results on several data sets,
20:   both real and simulated.
21: \end{abstract}
22: