99ee09716a6f50da.tex
1: \begin{abstract}
2: We propose a risk-averse statistical learning framework wherein the performance of a learning algorithm is evaluated by the conditional value-at-risk (CVaR) of losses rather than the expected loss.
3: We devise algorithms based on stochastic gradient descent for this framework.
4: While existing studies of CVaR optimization require direct access to the underlying distribution, our algorithms make a weaker assumption that only i.i.d.\ samples are given.
5: For convex and Lipschitz loss functions, we show that our algorithm has $O(1/\sqrt{n})$-convergence to the optimal CVaR, where $n$ is the number of samples.
6: For nonconvex and smooth loss functions, we show a  generalization bound on CVaR\@.
7: By conducting numerical experiments on various machine learning tasks, we demonstrate that our algorithms effectively minimize CVaR compared with other baseline algorithms.
8: \end{abstract}
9: