1: \begin{abstract}
2: Empirical Bayes provides a powerful approach to learning and adapting
3: to latent structure in data. Theory and algorithms for empirical Bayes
4: have a rich literature for sequence models, but are less understood in
5: settings where latent variables and data interact through more complex designs.
6:
7: In this work, we study empirical Bayes estimation of an i.i.d.\ prior in
8: Bayesian linear models, via the nonparametric maximum likelihood
9: estimator (NPMLE). We introduce and study a system of gradient flow equations
10: for optimizing the marginal log-likelihood, jointly
11: over the prior and posterior measures in its Gibbs variational
12: representation using a smoothed reparametrization of the regression
13: coefficients. A diffusion-based implementation yields a Langevin dynamics
14: MCEM algorithm, where the prior law evolves continuously over time to
15: optimize a sequence-model log-likelihood defined by the coordinates
16: of the current Langevin iterate.
17:
18: We show consistency of the NPMLE as $n,p \to \infty$ under mild
19: conditions, including settings of random sub-Gaussian designs when $n \asymp p$.
20: In high noise, we prove a uniform log-Sobolev inequality for the
21: mixing of Langevin dynamics, for possibly misspecified priors and
22: non-log-concave posteriors. We then establish polynomial-time
23: convergence of the joint gradient flow to a near-NPMLE if the marginal
24: negative log-likelihood is convex in a sub-level set of the initialization.
25: \end{abstract}
26: