1: \begin{abstract}
2: Black-box variational inference tries to approximate a complex target
3: distribution through a gradient-based optimization of the parameters
4: of a simpler distribution. Provable convergence guarantees require
5: structural properties of the objective. This paper shows that for
6: location-scale family approximations, if the target is M-Lipschitz
7: smooth, then so is the ``energy'' part of the variational objective.
8: The key proof idea is to describe gradients in a certain inner-product
9: space, thus permitting the use of Bessel's inequality. This result
10: gives bounds on the location of the optimal parameters, and is a key
11: ingredient for convergence guarantees.
12: \end{abstract}
13: