abstract:5f33e08f3dac9e60.tex

1: \begin{abstract}

2:   We study acquisition functions for active learning (AL) for text classification.

3:   The Expected Loss Reduction (ELR) method focuses on a Bayesian estimate of the reduction in classification error,

4:   recently updated with Mean Objective Cost of Uncertainty (MOCU).

5:   We convert the ELR framework to estimate the increase in (strictly proper) scores like log probability or negative mean square error,

6:   which we call Bayesian Estimate of Mean Proper Scores (BEMPS\footnote{Our implementation of BEMPS can be downloaded from \url{https://github.com/davidtw999/BEMPS}.}).

7:   We also prove convergence results borrowing techniques used with MOCU.

8:   In order to allow better experimentation with the new acquisition functions,

9:   we develop a complementary batch AL algorithm,

10:   which encourages diversity in the vector of expected changes in scores for unlabelled data.

11:   % Further, we develop a pool filtering approach that provides initial screening of the unlabelled pool to further speed up AL with large pools

12:   To allow high performance text classifiers,

13:   we combine ensembling and dynamic validation set construction on pretrained language models.

14:   Extensive experimental evaluation then explores how these different acquisition functions

15: %   and the implementation speedups

16:   perform.

17:   The results show that the use of mean square error and log probability with BEMPS yields robust acquisition functions, which consistently outperform the others tested.

18: \aside{Moreover, our implementation allows this class of estimation-based acquisition functions to be scaled to large pool sizes of 100,000.}

19: \end{abstract}

20: