abstract:f44165e007ec90a9.tex

1: \begin{abstract}

2: Empirical Bayes provides a powerful approach to learning and adapting

3: to latent structure in data. Theory and algorithms for empirical Bayes

4: have a rich literature for sequence models, but are less understood in

5: settings where latent variables and data interact through more complex designs.

6:

7: In this work, we study empirical Bayes estimation of an i.i.d.\ prior in

8: Bayesian linear models, via the nonparametric maximum likelihood

9: estimator (NPMLE). We introduce and study a system of gradient flow equations

10: for optimizing the marginal log-likelihood, jointly

11: over the prior and posterior measures in its Gibbs variational

12: representation using a smoothed reparametrization of the regression

13: coefficients. A diffusion-based implementation yields a Langevin dynamics

14: MCEM algorithm, where the prior law evolves continuously over time to

15: optimize a sequence-model log-likelihood defined by the coordinates

16: of the current Langevin iterate.

17:

18: We show consistency of the NPMLE as $n,p \to \infty$ under mild

19: conditions, including settings of random sub-Gaussian designs when $n \asymp p$.

20: In high noise, we prove a uniform log-Sobolev inequality for the

21: mixing of Langevin dynamics, for possibly misspecified priors and

22: non-log-concave posteriors. We then establish polynomial-time

23: convergence of the joint gradient flow to a near-NPMLE if the marginal

24: negative log-likelihood is convex in a sub-level set of the initialization.

25: \end{abstract}

26: