1: \begin{abstract}
2:
3: We introduce overdispersed black-box variational inference,
4: a method to reduce the variance of the Monte Carlo
5: estimator of the gradient in black-box variational inference.
6: Instead of taking samples from the variational distribution, we use
7: importance sampling to take samples from an overdispersed
8: distribution in the same exponential family as the variational
9: approximation. Our approach is general since it can be readily
10: applied to any exponential family distribution, which is the typical
11: choice for the variational approximation. We run experiments
12: on two non-conjugate probabilistic models to show
13: that our method effectively reduces the variance, and the
14: overhead introduced by the computation of the proposal parameters
15: and the importance weights is negligible. We find that
16: our overdispersed importance sampling scheme provides lower variance
17: than black-box variational inference, even when the latter uses twice
18: the number of samples. This results in faster convergence of the
19: black-box inference procedure.
20:
21: \end{abstract}