abstract:3b76069e091e2de6.tex

1: \begin{abstract}

2: We introduce TrustVI, a fast second-order algorithm for black-box variational

3: inference based on trust-region optimization and the ``reparameterization trick.''

4: At each iteration, TrustVI proposes and assesses a step based on

5: minibatches of draws from the variational distribution.

6: The algorithm provably converges to a stationary point.

7: %To prove convergence to a stationary point, we show that 1) in expectation,

8: %TrustVI's iterates improve the value of a surrogate objective function in

9: %proportion to the trust region size, and 2) the trust region radius cannot

10: %remain arbitrarily small indefinitely if the gradient of the objective at the

11: %iterates is bounded away from zero.

12: We implemented TrustVI in the Stan framework and compared it to two alternatives: Automatic Differentiation Variational Inference (ADVI) and

13: Hessian-free Stochastic Gradient Variational Inference (HFSGVI).

14: The former is based on stochastic first-order optimization.

15: The latter uses second-order information, but lacks convergence guarantees.

16: TrustVI typically converged at least one order of magnitude faster than ADVI,

17: demonstrating the value of stochastic second-order information.

18: TrustVI often found substantially better variational distributions than

19: HFSGVI, demonstrating that our convergence theory can matter in practice.

20: \end{abstract}

21: