1: \begin{abstract}
2: We develop an approach to risk minimization and stochastic optimization that
3: provides a convex surrogate for variance, allowing near-optimal and
4: computationally efficient trading between approximation and estimation
5: error. Our approach builds off of techniques for distributionally robust
6: optimization and Owen's empirical likelihood, and we provide a number of
7: finite-sample and asymptotic results characterizing the theoretical
8: performance of the estimator. In particular, we show that our procedure
9: comes with certificates of optimality, achieving (in some scenarios)
10: faster rates of convergence than empirical risk minimization
11: by virtue of automatically balancing bias and variance. We
12: give corroborating empirical evidence showing that in practice, the
13: estimator indeed trades between variance and absolute performance on a
14: training sample, improving out-of-sample (test) performance over standard
15: empirical risk minimization for a number of classification problems.
16: \end{abstract}
17: