abstract:7bfda549c5ed6733.tex

1: \begin{abstract}

2: %intro:

3: % Large pretrained language models are expensive to train but useful

4: %formalization

5: We propose a Distributional Approach to address Controlled Text Generation from pre-trained LMs (language models). This view permits to define pointwise, distributional, or hybrid constraints in a single framework --- to our knowledge, this is the first approach with such generality --- while minimizing KL divergence with the initial LM distribution.

6: The optimal target distribution is then uniquely determined, resulting in an explicit EBM (Energy-Based Model) representation.

7: % DPG

8: From this representation we then train the desired controlled autoregressive model through a KL-Adaptive DPG (Distributional Policy Gradient), an adaptive distributional variant of REINFORCE.

9: % empirical

10: We conduct a first set of experiments over pointwise constraints showing the advantages of our approach over a set of baselines, in terms of obtaining a controlled LM balancing constraint satisfaction with divergence from GPT-2.

11: We then conduct distributional and hybrid experiments

12: %(for which we are not aware of relevant baselines)

13: demonstrating the potential of our approach as a remedy to the well-known problem of Bias in Language Models.

14: %

15: Through an ablation study we show the effectiveness of our adaptive technique for obtaining faster convergence.

16: \end{abstract}

17: