a47e58907ac8c3a2.tex
1: \begin{abstract}
2:     In order to sample from an unnormalized probability density function, we propose to combine continuous normalizing flows (CNFs) with rejection-resampling steps based on importance weights. We relate the iterative training of CNFs with regularized velocity fields to a JKO scheme and prove convergence of the involved velocity fields to the velocity field of the Wasserstein gradient flow (WGF).
3:     The alternation of local flow steps and non-local rejection-resampling steps allows to overcome local minima or slow convergence of the WGF for multimodal distributions.
4:     Since the proposal of the rejection step is generated by the model itself, 
5:     they do not suffer from common drawbacks of classical rejection schemes.
6:     The arising model can be trained iteratively, reduces the reverse Kulback-Leibler (KL) loss function in each step, allows to generate \textit{iid} samples and moreover allows for evaluations of the generated underlying density.
7:     Numerical examples show that our method yields accurate results on various test distributions including high-dimensional multimodal targets and outperforms the state of the art in almost all cases significantly.
8: \end{abstract}
9: