618b9a8df71c5939.tex
1: \begin{abstract}
2:   We develop an approach to risk minimization and stochastic optimization that
3:   provides a convex surrogate for variance, allowing near-optimal and
4:   computationally efficient trading between approximation and estimation
5:   error. Our approach builds off of techniques for distributionally robust
6:   optimization and Owen's empirical likelihood, and we provide a number of
7:   finite-sample and asymptotic results characterizing the theoretical
8:   performance of the estimator. In particular, we show that our procedure
9:   comes with certificates of optimality, achieving (in some scenarios)
10:   faster rates of convergence than empirical risk minimization
11:   by virtue of automatically balancing bias and variance. We
12:   give corroborating empirical evidence showing that in practice, the
13:   estimator indeed trades between variance and absolute performance on a
14:   training sample, improving out-of-sample (test) performance over standard
15:   empirical risk minimization for a number of classification problems.
16: \end{abstract}
17: