0dc951e5fea39831.tex
1: \begin{abstract} 
2: We propose a randomized first order optimization method---\texttt{SEGA} (SkEtched GrAdient)---which progressively throughout its iterations builds a variance-reduced estimate of the gradient from random linear measurements (sketches) of the gradient obtained from an oracle. In each iteration, \texttt{SEGA} updates the current estimate of the gradient through a sketch-and-project operation using the information provided by the latest sketch, and this is subsequently used to compute an unbiased estimate of the true gradient through a random relaxation procedure. This unbiased estimate is then used to perform a gradient step. Unlike standard subspace descent methods, such as coordinate descent, \texttt{SEGA} can be used for optimization problems with a {\em non-separable} proximal term. We provide a general convergence analysis and prove linear convergence for strongly convex objectives. In the special case of coordinate sketches, \texttt{SEGA} can be enhanced with various techniques such as {\em importance sampling}, {\em minibatching} and {\em acceleration}, and its rate is up to a small constant factor identical to the best-known rate of coordinate descent. 
3: \end{abstract}
4: