12545e79df75d83c.tex
1: \begin{abstract}
2:   Denoising is intuitively related to projection. Indeed, under the manifold
3:   hypothesis, adding random noise is approximately equivalent to orthogonal
4:   perturbation. Hence, learning to denoise is approximately learning to project.
5:   In this paper, we use this observation to reinterpret denoising diffusion
6:   models as approximate gradient descent applied to the Euclidean distance
7:   function. We then provide straight-forward convergence analysis of
8:   the DDIM sampler under simple assumptions on the projection-error of the
9:  denoiser.
10:   Finally, we propose a new sampler based on two simple modifications to DDIM
11:   using insights from our theoretical results. In as few as 5-10 function
12:   evaluations, our sampler achieves state-of-the-art FID scores on pretrained
13:   CIFAR-10 and CelebA models and can generate high quality samples on latent
14:   diffusion models.
15: \end{abstract}
16: