abstract:802d567611cf5178.tex

1: \begin{abstract}

2:  Bayesian nonparametric (BNP) models provide elegant methods for discovering underlying latent features within a data set, but inference in such models can be slow. We exploit the fact that completely random measures, which commonly-used models like the Dirichlet process and the beta-Bernoulli process can be expressed using, are decomposable into independent sub-measures.   We use this decomposition to partition the latent measure into a finite measure containing only instantiated components, and an infinite measure containing all other components. We then select different inference algorithms for the two components: uncollapsed samplers mix well on the finite measure, while collapsed samplers mix well on the infinite, sparsely occupied tail. The resulting hybrid algorithm can be applied to a wide class of models, and can be easily distributed to allow scalable inference without sacrificing asymptotic convergence guarantees.

3:  \blfootnote{$*$ denotes equal contribution}

4:   %We exploit the fact that most commonly used BNP models, including the Dirichlet process and the beta-Bernoulli process, can be expressed in terms of

5:   %In this paper, we provide the hybrid sampler, and its distributed variants for a multitude of BNP models, including, beta-Bernoulli process, Dirichlet process, Hierarchical Dirichlet process and Pitman-Yor process. %\avi{I also want to say that we are the first to do beta-Bernoulli on 1 Million images, but politely}

6: \end{abstract}

7: