1: \begin{abstract}
2: We present an approach towards convex optimization that relies on a novel scheme which converts online adaptive algorithms into offline methods.
3: In the offline optimization setting, our derived methods are shown to obtain favourable adaptive guarantees which depend on the \emph{harmonic sum} of the queried gradients. We further show that our methods implicitly adapt to the objective's structure:
4: in the smooth case fast convergence rates are ensured without any prior knowledge of the smoothness parameter, while still maintaining guarantees in the non-smooth setting.
5: %This contrasts with line-search GD (gradient descent) procedures which do not hold guarantees in the non-smooth setting.
6: %require in to know advance whether the problem is smooth/non-smooth.
7: % Our derived method implicitly adapts to the objective's structure, attaining adaptive bounds which depend on the harmonic sum of gradient
8: %
9: %
10: % attaining fast convergence rates without any prior knowledge of the smoothness.
11: %This contrasts with the GD (gradient descent) algorithm which requires the smoothness parameter in order to obtain such fast rates.
12: %These rates are comparable to the ones attained by the GD (gradient descent) algorithm had it known the smoothness parameter.
13: Our approach has a natural extension to the stochastic setting, resulting in a lazy version of SGD (stochastic GD), where minibathces are chosen \emph{adaptively} depending on the magnitude of the gradients. Thus providing a principled approach towards choosing minibatch sizes.
14: \end{abstract}