68378060bc35b6b6.tex
1: \begin{abstract}
2: %Geometric inductive biases such as spatial curvature, factorizability, or equivariance can aid learning of representations that better reflect the latent structure of a dataset, which in turn can improve generalization in downstream tasks. 
3: Incorporating geometric inductive biases into models can aid interpretability and generalization, but encoding to a specific geometric structure can be challenging due to the imposed topological constraints. In this paper, we theoretically and empirically characterize obstructions to training encoders with geometric latent spaces. We show that local optima can arise due to singularities (e.g.~self-intersection) or due to an incorrect degree or winding number. We then discuss how normalizing flows can potentially circumvent these obstructions by defining multimodal variational distributions. Inspired by this observation, we propose a new flow-based model that maps data points to multimodal distributions over geometric spaces and empirically evaluate our model on 2 domains. We observe improved stability during training and a higher chance of converging to a homeomorphic encoder.
4:  %which in turn improves the stability of training and convergence to a homeomorphic mapping. We perform empirical evaluations in 2 domains, which demonstrate that flow-based models succeed at circumvent the identified optimization obstructions.
5:  %which in turn improves the stability of training and convergence to a homeomorphic mapping. We perform empirical evaluations in 2 domains, which demonstrate that flow-based models succeed at circumvent the identified optimization obstructions.
6: \end{abstract}
7: