abstract:3657a8f461731b0e.tex

1: \begin{abstract}

2: Stochastic gradient algorithms estimate the gradient based on only one or a few samples

3: and enjoy low computational cost per iteration. They have been widely used in large-scale optimization problems.

4: However, stochastic gradient algorithms are usually slow to converge and achieve sub-linear convergence rates,

5: due to the inherent variance in the gradient computation. To accelerate the convergence,

6: some variance-reduced stochastic gradient algorithms, e.g., proximal stochastic variance-reduced gradient (Prox-SVRG) algorithm,

7: have recently been proposed to solve strongly convex problems. Under the strongly convex condition,

8: these variance-reduced stochastic gradient algorithms achieve a linear convergence rate. However,

9: many machine learning problems are convex but not strongly convex. In this paper,

10: we introduce Prox-SVRG and its projected variant called Variance-Reduced Projected Stochastic Gradient (VRPSG) to

11: solve a class of non-strongly convex optimization problems widely used in machine learning. As the main technical contribution of this paper,

12: we show that both VRPSG and Prox-SVRG achieve a linear convergence rate without strong convexity.

13: A key ingredient in our proof is a Semi-Strongly Convex (SSC) inequality which is the first to be rigorously proved for a class of non-strongly convex problems in both constrained

14: and regularized settings. Moreover, the SSC inequality is independent of algorithms and may be applied to analyze other stochastic gradient algorithms

15: besides VRPSG and Prox-SVRG, which may be of independent interest.

16: To the best of our knowledge, this is the first work that establishes the linear convergence rate for the

17: variance-reduced stochastic gradient algorithms on solving both constrained and regularized problems without strong convexity.

18: \end{abstract}

19: