abstract:90dd53c72cb2db7d.tex

1: \begin{abstract}%

2: We consider stochastic approximation for the least squares regression problem in the non-strongly convex setting.

3: %

4: We present the first practical algorithm that achieves the optimal prediction error rates in terms of dependence on the noise of the problem, as $O(d/t)$ while accelerating the forgetting of the initial conditions to $O(d/t^2)$.

5: %

6: Our new algorithm is based on a simple modification of the accelerated gradient descent.

7: %

8: We provide convergence results for both the averaged and the last iterate of the algorithm.

9: %

10: In order to describe the tightness of these new bounds, we present a matching lower bound in the noiseless setting  and thus show the optimality of our algorithm.

11: \end{abstract}

12: