562d59003f477a11.tex
1: \begin{abstract}
2: 
3:   We introduce overdispersed black-box variational inference, 
4:   a method to reduce the variance of the Monte Carlo
5:   estimator of the gradient in black-box variational inference.
6:   Instead of taking samples from the variational distribution, we use
7:   importance sampling to take samples from an overdispersed
8:   distribution in the same exponential family as the variational
9:   approximation. Our approach is general since it can be readily
10:   applied to any exponential family distribution, which is the typical
11:   choice for the variational approximation. We run experiments
12:   on two non-conjugate probabilistic models to show
13:   that our method effectively reduces the variance, and the
14:   overhead introduced by the computation of the proposal parameters
15:   and the importance weights is negligible. We find that
16:   our overdispersed importance sampling scheme provides lower variance
17:   than black-box variational inference, even when the latter uses twice
18:   the number of samples. This results in faster convergence of the
19:   black-box inference procedure.
20: 
21: \end{abstract}