530322a048cf1653.tex
1: \begin{abstract}
2: We consider the problem of sampling a multimodal distribution with a Markov chain given a small number of samples from the stationary measure.
3: Although mixing can be arbitrarily slow, we show that if the Markov chain has a $k$th order spectral gap, 
4: initialization from a set of 
5: $\tilde O(k/\varepsilon^2)$ samples from the stationary distribution will, with high probability over the samples, efficiently generate a sample whose conditional law is $\varepsilon$-close in TV distance to the stationary measure.
6: In particular, this applies to mixtures of $k$ distributions satisfying a Poincar\'e inequality, with faster convergence when they satisfy a log-Sobolev inequality.
7: Our bounds are stable to perturbations to the Markov chain, and in particular work for Langevin diffusion over $\mathbb R^d$ with score estimation error, as well as Glauber dynamics combined with approximation error from pseudolikelihood estimation. 
8: This justifies the success of data-based initialization for score matching methods despite slow mixing for the data distribution, and improves and generalizes the results of \cite{koehler2023sampling} to have linear, rather than exponential, dependence on $k$ and apply to arbitrary semigroups. As a consequence of our results, we show for the first time that a natural class of low-complexity Ising measures can be efficiently learned from samples.
9: \end{abstract}
10: