abstract:3b12d448baf19074.tex

1: \begin{abstract}

2:

3:     Disentanglement is at the forefront of unsupervised learning, as disentangled representations of data improve generalization, interpretability, and performance in downstream tasks.

4:     Current unsupervised approaches remain inapplicable for real-world datasets since they are highly variable in their performance and fail to reach levels of disentanglement of (semi-)supervised approaches.

5:     We introduce population-based training (PBT) for improving consistency in training variational autoencoders (VAEs) and

6:     demonstrate the validity of this approach in a supervised setting (PBT-VAE).

7:     We then use Unsupervised Disentanglement Ranking (UDR) as an unsupervised heuristic to score models in our PBT-VAE training and show how models trained this way tend to consistently disentangle only a subset of the generative factors.

8:     Building on top of this observation we introduce the recursive \rpuVAE{} approach.

9:     We train the model until convergence, remove the learned factors from the dataset and reiterate.

10:     In doing so, we can label subsets of the dataset with the learned factors and consecutively use these labels to train one model that fully disentangles the whole dataset.

11:     With this approach, we show striking improvement in state-of-the-art unsupervised disentanglement performance and robustness across multiple datasets and metrics.

12:

13: \end{abstract}

14: