1: \begin{abstract}
2: This paper aims to conduct a comprehensive theoretical analysis of current diffusion models. We introduce a novel generative learning methodology utilizing the Schr{\"o}dinger bridge diffusion model in latent space as the framework for theoretical exploration in this domain. Our approach commences with the pre-training of an encoder-decoder architecture using data originating from a distribution that may diverge from the target distribution, thus facilitating the accommodation of a large sample size through the utilization of pre-existing large-scale models. Subsequently, we develop a diffusion model within the latent space utilizing the Schr{\"o}dinger bridge framework.
3: Our theoretical analysis encompasses the establishment of end-to-end error analysis for learning distributions via the latent Schr{\"o}dinger bridge diffusion model. Specifically, we control the second-order Wasserstein distance between the generated distribution and the target distribution. Furthermore, our obtained convergence rates effectively mitigate the curse of dimensionality, offering robust theoretical support for prevailing diffusion models.
4:
5:
6:
7:
8: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
9: \vspace{0.5cm} \noindent{\bf KEY WORDS}:
10: Diffusion models, Schr{\"o}dinger bridge, Encoder-decoder, Curse of dimensionality,
11: End-to-end error analysis.
12: \end{abstract}
13: