de3f7226ff822bda.tex
1: \begin{abstract}
2:     In diffusion models, samples are generated through an iterative refinement process, requiring hundreds of sequential model evaluations. 
3:     Several recent methods have introduced approximations (fewer discretization steps or distillation) to trade off speed at the cost of sample quality. In contrast, we introduce Self-Refining Diffusion Samplers (\texttt{SRDS}) that retain sample quality and can improve latency at the cost of additional parallel compute.
4:     We take inspiration from the Parareal algorithm, a popular numerical method for parallel-in-time integration of differential equations. 
5:     In \texttt{SRDS}, a quick but rough estimate of a sample is first created and then iteratively refined \textit{in parallel} through Parareal iterations.
6:     \texttt{SRDS} is not only guaranteed to accurately solve the ODE and converge to the serial solution but also benefits from parallelization across the diffusion trajectory,
7:     enabling batched inference and pipelining. As we demonstrate for pre-trained diffusion models, the early convergence of this refinement procedure drastically reduces the number of steps required to produce a sample, speeding up generation for instance by up to 1.7x on a 25-step StableDiffusion-v2 benchmark and up to 4.3x on longer trajectories.
8: \end{abstract}
9: