3b12d448baf19074.tex
1: \begin{abstract}
2: 
3:     Disentanglement is at the forefront of unsupervised learning, as disentangled representations of data improve generalization, interpretability, and performance in downstream tasks.
4:     Current unsupervised approaches remain inapplicable for real-world datasets since they are highly variable in their performance and fail to reach levels of disentanglement of (semi-)supervised approaches.
5:     We introduce population-based training (PBT) for improving consistency in training variational autoencoders (VAEs) and
6:     demonstrate the validity of this approach in a supervised setting (PBT-VAE).
7:     We then use Unsupervised Disentanglement Ranking (UDR) as an unsupervised heuristic to score models in our PBT-VAE training and show how models trained this way tend to consistently disentangle only a subset of the generative factors.
8:     Building on top of this observation we introduce the recursive \rpuVAE{} approach.
9:     We train the model until convergence, remove the learned factors from the dataset and reiterate.
10:     In doing so, we can label subsets of the dataset with the learned factors and consecutively use these labels to train one model that fully disentangles the whole dataset.
11:     With this approach, we show striking improvement in state-of-the-art unsupervised disentanglement performance and robustness across multiple datasets and metrics.
12:     
13: \end{abstract}
14: