1: \begin{abstract}
2: In this work, we demonstrate how differentiable stochastic sampling techniques developed in the context of deep Reinforcement Learning can be used to perform efficient parameter inference over stochastic, simulation-based, forward models. As a particular example, we focus on the problem of estimating parameters of Halo Occupancy Distribution (HOD) models which are used to connect galaxies with their dark matter halos. Using a combination of continuous relaxation and gradient re-parameterisation techniques, we can obtain well-defined gradients with respect to HOD parameters through discrete galaxy catalogs realisations. Having access to these gradients allows us to leverage efficient sampling schemes, such as Hamiltonian Monte-Carlo, and greatly speed up parameter inference.
3: %Using the Gumbel-Softmax approach, we can map the discrete HOD models to a continuous distribution with well defined derivatives, allowing the use of first order optimization methods, such as Hamiltonian Monte Carlo, for analysis of data.
4: We demonstrate our technique on a mock galaxy catalog generated from the Bolshoi simulation using the
5: \cite{2007zheng} %Zheng et al. 2007
6: HOD model and find %,finding
7: near identical posteriors as standard Markov Chain Monte Carlo techniques with an increase of $\sim 8$x in convergence efficiency.
8: Our differentiable HOD model %This model
9: also has broad applications in full forward model approaches to cosmic structure and cosmological analysis. \github
10: \end{abstract}
11: