1: \begin{abstract}
2: Machine learning models have been criticized for reflecting unfair biases in the training data.
3: Instead of solving for this by introducing fair learning algorithms directly, we focus on generating fair synthetic data, such that any downstream learner is fair.
4: Generating fair synthetic data from unfair data--- \emph{while remaining truthful to the underlying data-generating process (DGP)} ---is non-trivial.
5: In this paper, we introduce DECAF: a GAN-based {\it fair} synthetic data generator for tabular data.
6: With DECAF we embed the DGP explicitly as a structural causal model in the input layers of the generator, allowing each variable to be reconstructed conditioned on its causal parents.
7: This procedure enables \textit{inference-time} debiasing, where biased edges can be strategically removed for satisfying user-defined fairness requirements.
8: The DECAF framework is versatile and compatible with several popular definitions of fairness.
9: In our experiments, we show that DECAF successfully removes undesired bias and--- in contrast to existing methods ---is capable of generating high-quality synthetic data.
10: Furthermore, we provide theoretical guarantees on the generator's convergence and the fairness of downstream models.
11: \end{abstract}
12: