e24f8c0462298be1.tex
1: \begin{abstract}
2:     We consider the problem of asynchronous stochastic optimization, where
3:     an optimization algorithm makes updates based on stale stochastic gradients of the objective that are subject to an arbitrary (possibly adversarial) sequence of delays.
4:     We present a procedure which, for any given $q \in (0,1]$, transforms any standard stochastic first-order method to an asynchronous method with convergence guarantee depending on the $q$-quantile delay of the sequence.
5:     This approach leads to convergence rates of the form $O(\delayqq/qT+\sigma/\sqrt{qT})$ for non-convex and $O(\delayqq^2/(q T)^2+\sigma/\sqrt{qT})$ for convex smooth problems, where $\delayqq$ is the $q$-quantile delay, generalizing and improving on existing results that depend on the average delay.
6:     We further show a method that 
7:     automatically adapts to all quantiles simultaneously, without any prior knowledge of the delays, achieving convergence rates of the form $O(\inf_{q} \delayqq/qT+\sigma/\sqrt{qT})$ for non-convex and $O(\inf_{q} \delayqq^2/(q T)^2+\sigma/\sqrt{qT})$ for convex smooth problems.
8:     Our technique is based on asynchronous mini-batching with a careful batch-size selection and filtering of stale gradients.
9: \end{abstract}
10: