abstract:7ba85f99b7e250c1.tex

1: \begin{abstract}

2: We present Diffusion Model Patching (DMP), a simple method to boost the performance of pre-trained diffusion models that have \textit{already reached convergence}, with a negligible increase in parameters.

3: DMP inserts a small, learnable set of prompts into the model's input space while keeping the original model frozen.

4: The effectiveness of DMP is not merely due to the addition of parameters but stems from its dynamic gating mechanism, which selects and combines a subset of learnable prompts at every step of the generative process (\ie, reverse denoising steps).

5: This strategy, which we term ``mixture-of-prompts'', enables the model to draw on the distinct expertise of each prompt, essentially ``patching'' the model's functionality at every step with minimal yet specialized parameters.

6: Uniquely, DMP enhances the model by further training on the same dataset on which it was originally trained, even in a scenario where significant improvements are typically not expected due to model convergence.

7: Experiments show that DMP significantly enhances the converged FID of DiT-L/2 on FFHQ 256$\times$256 by 10.38\%, achieved with only a 1.43\% parameter increase and 50K additional training iterations.

8: % Our findings further demonstrate DMP's broad applicability across various model architectures and sizes, as well as its effectiveness in diverse generative tasks.

9: \end{abstract}

10: