7bfda549c5ed6733.tex
1: \begin{abstract}
2: %intro: 
3: % Large pretrained language models are expensive to train but useful 
4: %formalization
5: We propose a Distributional Approach to address Controlled Text Generation from pre-trained LMs (language models). This view permits to define pointwise, distributional, or hybrid constraints in a single framework --- to our knowledge, this is the first approach with such generality --- while minimizing KL divergence with the initial LM distribution. 
6: The optimal target distribution is then uniquely determined, resulting in an explicit EBM (Energy-Based Model) representation.
7: % DPG
8: From this representation we then train the desired controlled autoregressive model through a KL-Adaptive DPG (Distributional Policy Gradient), an adaptive distributional variant of REINFORCE. 
9: % empirical 
10: We conduct a first set of experiments over pointwise constraints showing the advantages of our approach over a set of baselines, in terms of obtaining a controlled LM balancing constraint satisfaction with divergence from GPT-2.  
11: We then conduct distributional and hybrid experiments 
12: %(for which we are not aware of relevant baselines)  
13: demonstrating the potential of our approach as a remedy to the well-known problem of Bias in Language Models. 
14: %
15: Through an ablation study we show the effectiveness of our adaptive technique for obtaining faster convergence. 
16: \end{abstract}
17: