93431d665ccca2f0.tex
1: \begin{abstract}
2:   Denoising diffusion models  are a
3:   recent class of generative models exhibiting state-of-the-art performance in
4:   image and audio synthesis. Such models approximate the time-reversal of a
5:   forward noising process from a target distribution to a
6:   reference density, which is usually Gaussian. Despite their strong empirical results,
7:   the theoretical analysis of such models remains limited. In particular, all
8:   current approaches crucially assume that the target density admits a density
9:   w.r.t.\ the Lebesgue measure. This does not cover settings where the target
10:   distribution is supported on a lower-dimensional manifold or is given by some
11:   empirical distribution. In this paper, we bridge this gap by providing the
12:   first convergence results for diffusion models in this more general setting. In
13:   particular, we provide quantitative bounds on the Wasserstein distance of
14:   order one between the target data distribution and the generative distribution of the
15:   diffusion model.
16: \end{abstract}
17: