1: \begin{abstract} We consider a distributionally robust formulation of stochastic optimization problems arising in statistical learning, where robustness is with respect to uncertainty in the underlying data distribution. Our formulation builds on risk-averse optimization techniques and the theory of coherent risk measures. It uses semi-deviation risk for quantifying uncertainty, allowing us to compute solutions that are robust against perturbations in the population data distribution. { We consider a broad class of generalized differentiable loss functions that can be non-convex and non-smooth, involving upward and downward cusps, and we develop an efficient stochastic subgradient method for distributionally robust problems with such functions. We prove that it converges to a point satisfying the optimality conditions. To our knowledge, this is the first method with rigorous convergence guarantees in the context of generalized differentiable non-convex and non-smooth distributionally robust stochastic optimization. Our method allows for control of the desired level of robustness with little extra computational cost compared to population risk minimization \mg{with stochastic gradient methods}.} We also illustrate the performance of our algorithm on real datasets arising in convex and non-convex supervised learning problems. %logistic regression and deep learning tasks. %has a similar computational and statistical complexity compared to the stochastic gradient method applied to the risk minimization problem. %To our knowledge, it is the first method with provable convergence guarantees for solving a distributionally robust formulation of a population minimization problem.
2: \end{abstract}
3: