1: \begin{abstract}
2: Selecting the most appropriate data examples to present a deep neural network
3: (DNN) at different stages of training is an unsolved challenge. Though
4: practitioners typically ignore this problem, a non-trivial data scheduling
5: method may result in a significant improvement in both convergence and
6: generalization performance. In this paper, we introduce Self-Paced Learning
7: with Adaptive Deep Visual Embeddings (SPL-ADVisE), a novel end-to-end training
8: protocol that unites self-paced learning (SPL) and deep metric learning (DML).
9: We leverage the Magnet Loss to train an \emph{embedding} convolutional neural
10: network (CNN) to learn a salient representation space. The \emph{student} CNN
11: classifier dynamically selects similar instance-level training examples to form
12: a mini-batch, where the \textit{easiness} from the cross-entropy loss and the
13: \textit{true diverseness} of examples from the learned metric space serve as
14: sample importance priors. To demonstrate the effectiveness of SPL-ADVisE, we
15: use deep CNN architectures for the task of supervised image classification on
16: several coarse- and fine-grained visual recognition datasets. Results show
17: that, across all datasets, the proposed method converges faster and reaches a
18: higher final accuracy than other SPL variants, particularly on fine-grained
19: classes.
20: \end{abstract}
21: