1: \begin{abstract}
2: Denoising diffusion models are a
3: recent class of generative models exhibiting state-of-the-art performance in
4: image and audio synthesis. Such models approximate the time-reversal of a
5: forward noising process from a target distribution to a
6: reference density, which is usually Gaussian. Despite their strong empirical results,
7: the theoretical analysis of such models remains limited. In particular, all
8: current approaches crucially assume that the target density admits a density
9: w.r.t.\ the Lebesgue measure. This does not cover settings where the target
10: distribution is supported on a lower-dimensional manifold or is given by some
11: empirical distribution. In this paper, we bridge this gap by providing the
12: first convergence results for diffusion models in this more general setting. In
13: particular, we provide quantitative bounds on the Wasserstein distance of
14: order one between the target data distribution and the generative distribution of the
15: diffusion model.
16: \end{abstract}
17: