abstract:a3149334cee887dd.tex

1: \begin{abstract}

2: We present an approach towards convex optimization that relies on a  novel scheme  which converts   online adaptive algorithms into  offline methods.

3:  In the offline optimization setting, our derived methods are shown to obtain  favourable adaptive guarantees which depend on the \emph{harmonic sum} of the queried gradients.  We further show that our methods implicitly adapt to the objective's structure:

4: in the smooth case   fast convergence rates are ensured without any prior knowledge of the smoothness parameter, while still maintaining guarantees in the non-smooth setting.

5: %This contrasts with line-search GD (gradient descent) procedures which do not hold guarantees in the non-smooth setting.

6: %require in to know advance whether the problem is smooth/non-smooth.

7: % Our derived method implicitly adapts to the objective's structure, attaining adaptive bounds which depend on the harmonic sum of gradient

8: %

9: %

10: % attaining  fast convergence rates without any prior knowledge of the smoothness.

11: %This contrasts with the GD (gradient descent) algorithm which requires the smoothness parameter in order to obtain such fast rates.

12: %These rates are comparable to the ones attained by the GD (gradient descent) algorithm had it known the smoothness parameter.

13: Our approach has a natural extension to the stochastic setting, resulting in a lazy version of SGD (stochastic GD), where minibathces are chosen \emph{adaptively}  depending on the magnitude of the gradients. Thus providing a principled approach towards choosing minibatch sizes.

14: \end{abstract}