098141888a752983.tex
1: \begin{abstract}
2:   We propose Batch-Expansion Training (BET), a framework for running a
3:   batch optimizer on a gradually expanding dataset. As opposed to
4:   stochastic approaches, batches do not need to be resampled i.i.d. at every
5:   iteration, thus making BET more resource efficient in a distributed
6:   setting, and when disk-access is constrained. Moreover, BET can be
7:   easily paired with most batch optimizers, does not require any
8:   parameter-tuning, and compares favorably to existing stochastic and
9:   batch methods. We show that when the batch size grows exponentially
10:   with the number of outer iterations, BET achieves optimal
11:   $\Ocalt(1/\epsilon)$ data-access convergence rate for strongly convex
12:   objectives. Experiments in parallel and distributed settings show
13:   that BET performs better than standard batch and stochastic
14:   approaches. 
15: \end{abstract}
16: