abstract:562d59003f477a11.tex

1: \begin{abstract}

2:

3:   We introduce overdispersed black-box variational inference,

4:   a method to reduce the variance of the Monte Carlo

5:   estimator of the gradient in black-box variational inference.

6:   Instead of taking samples from the variational distribution, we use

7:   importance sampling to take samples from an overdispersed

8:   distribution in the same exponential family as the variational

9:   approximation. Our approach is general since it can be readily

10:   applied to any exponential family distribution, which is the typical

11:   choice for the variational approximation. We run experiments

12:   on two non-conjugate probabilistic models to show

13:   that our method effectively reduces the variance, and the

14:   overhead introduced by the computation of the proposal parameters

15:   and the importance weights is negligible. We find that

16:   our overdispersed importance sampling scheme provides lower variance

17:   than black-box variational inference, even when the latter uses twice

18:   the number of samples. This results in faster convergence of the

19:   black-box inference procedure.

20:

21: \end{abstract}