1: \begin{abstract}
2:
3: Directly learning from examples of random difficulty levels is often challenging for both humans and machine learning models.
4: A more effective strategy involves exposing learners to examples in a progressive order, from easy to difficult.
5: Curriculum Learning (CL) has been proposed to implement this strategy in machine learning model training.
6: However, two key challenges persist in CL framework design: defining the difficulty of training data and determining the appropriate amount of data to input at each training step.
7: This paper presents a \modelname~(\modelabbr), drawing inspiration from psychometrics.
8: We quantify the difficulty of training data by applying Item Response Theory (IRT) to responses from Artificial Crowds (AC).
9: This theory-driven IRT-AC approach leads to global (i.e., model-independent) and interpretable difficulty values.
10: Leveraging IRT, we propose a Dynamic Data Selection via Model Ability Estimation (DDS-MAE) strategy to schedule the appropriate amount of data during model training.
11: Since our difficulty labeling and model ability estimation are based on a consistent theory, namely IRT, their values are comparable within the same scope, potentially leading to a faster convergence compared to the other CL methods.
12: Experimental results demonstrate that fine-tuning pre-trained language models with \modelabbr{} enhances their performance on the GLUE benchmark.
13: Moreover, \modelabbr{} surpasses other state-of-the-art (SOTA) CL methods on the GLUE benchmark.
14: We further explore the components of \modelabbr{}, namely the difficulty measurer (IRT-AC) and the training scheduler (DDS-MAE) qualitatively and quantitatively.
15: Lastly, we conduct an ablation study to clarify which components of \modelabbr{} contribute to faster convergence and higher accuracy.
16:
17:
18: \end{abstract}