1: \begin{abstract}
2: %intro:
3: % Large pretrained language models are expensive to train but useful
4: %formalization
5: We propose a Distributional Approach to address Controlled Text Generation from pre-trained LMs (language models). This view permits to define pointwise, distributional, or hybrid constraints in a single framework --- to our knowledge, this is the first approach with such generality --- while minimizing KL divergence with the initial LM distribution.
6: The optimal target distribution is then uniquely determined, resulting in an explicit EBM (Energy-Based Model) representation.
7: % DPG
8: From this representation we then train the desired controlled autoregressive model through a KL-Adaptive DPG (Distributional Policy Gradient), an adaptive distributional variant of REINFORCE.
9: % empirical
10: We conduct a first set of experiments over pointwise constraints showing the advantages of our approach over a set of baselines, in terms of obtaining a controlled LM balancing constraint satisfaction with divergence from GPT-2.
11: We then conduct distributional and hybrid experiments
12: %(for which we are not aware of relevant baselines)
13: demonstrating the potential of our approach as a remedy to the well-known problem of Bias in Language Models.
14: %
15: Through an ablation study we show the effectiveness of our adaptive technique for obtaining faster convergence.
16: \end{abstract}
17: