1: \begin{abstract}
2:
3: Disentanglement is at the forefront of unsupervised learning, as disentangled representations of data improve generalization, interpretability, and performance in downstream tasks.
4: Current unsupervised approaches remain inapplicable for real-world datasets since they are highly variable in their performance and fail to reach levels of disentanglement of (semi-)supervised approaches.
5: We introduce population-based training (PBT) for improving consistency in training variational autoencoders (VAEs) and
6: demonstrate the validity of this approach in a supervised setting (PBT-VAE).
7: We then use Unsupervised Disentanglement Ranking (UDR) as an unsupervised heuristic to score models in our PBT-VAE training and show how models trained this way tend to consistently disentangle only a subset of the generative factors.
8: Building on top of this observation we introduce the recursive \rpuVAE{} approach.
9: We train the model until convergence, remove the learned factors from the dataset and reiterate.
10: In doing so, we can label subsets of the dataset with the learned factors and consecutively use these labels to train one model that fully disentangles the whole dataset.
11: With this approach, we show striking improvement in state-of-the-art unsupervised disentanglement performance and robustness across multiple datasets and metrics.
12:
13: \end{abstract}
14: