abstract:99ee09716a6f50da.tex

1: \begin{abstract}

2: We propose a risk-averse statistical learning framework wherein the performance of a learning algorithm is evaluated by the conditional value-at-risk (CVaR) of losses rather than the expected loss.

3: We devise algorithms based on stochastic gradient descent for this framework.

4: While existing studies of CVaR optimization require direct access to the underlying distribution, our algorithms make a weaker assumption that only i.i.d.\ samples are given.

5: For convex and Lipschitz loss functions, we show that our algorithm has $O(1/\sqrt{n})$-convergence to the optimal CVaR, where $n$ is the number of samples.

6: For nonconvex and smooth loss functions, we show a  generalization bound on CVaR\@.

7: By conducting numerical experiments on various machine learning tasks, we demonstrate that our algorithms effectively minimize CVaR compared with other baseline algorithms.

8: \end{abstract}

9: