abstract:23c35a3de1b63259.tex

1: \begin{abstract}

2:

3: Directly learning from examples of random difficulty levels is often challenging for both humans and machine learning models.

4: A more effective strategy involves exposing learners to examples in a progressive order, from easy to difficult.

5: Curriculum Learning (CL) has been proposed to implement this strategy in machine learning model training.

6: However, two key challenges persist in CL framework design: defining the difficulty of training data and determining the appropriate amount of data to input at each training step.

7: This paper presents a \modelname~(\modelabbr), drawing inspiration from psychometrics.

8: We quantify the difficulty of training data by applying Item Response Theory (IRT) to responses from Artificial Crowds (AC).

9: This theory-driven IRT-AC approach leads to global (i.e., model-independent) and interpretable difficulty values.

10: Leveraging IRT, we propose a Dynamic Data Selection via Model Ability Estimation (DDS-MAE) strategy to schedule the appropriate amount of data during model training.

11: Since our difficulty labeling and model ability estimation are based on a consistent theory, namely IRT, their values are comparable within the same scope, potentially leading to a faster convergence compared to the other CL methods.

12: Experimental results demonstrate that fine-tuning pre-trained language models with \modelabbr{} enhances their performance on the GLUE benchmark.

13: Moreover, \modelabbr{} surpasses other state-of-the-art (SOTA) CL methods on the GLUE benchmark.

14: We further explore the components of \modelabbr{}, namely the difficulty measurer (IRT-AC) and the training scheduler (DDS-MAE) qualitatively and quantitatively.

15: Lastly, we conduct an ablation study to clarify which components of \modelabbr{} contribute to faster convergence and higher accuracy.

16:

17:

18: \end{abstract}