6f7c0473bef4c39f.tex
1: \begin{abstract}
2:     \noindent \uclip{} is a simple amendment to gradient clipping
3:     that can be applied to any iterative gradient optimization
4:     algorithm. Like regular clipping, \uclip{} involves using
5:     gradients that are clipped to a prescribed size
6:     (e.g.~with component wise or norm based clipping) 
7:     but instead of discarding the clipped portion of the gradient,
8:     \uclip{} maintains a buffer of these values that is added to the gradients
9:     on the next iteration (before clipping).
10:     We show that the cumulative bias of the~\uclip{} updates is bounded by a constant.
11:     This implies that the clipped updates are unbiased \emph{on average}.
12:     Convergence follows via a lemma that guarantees
13:     convergence with updates $u_i$ as long as
14:     $\sum_{i=1}^t (u_i - g_i) = o(t)$ where $g_i$ are the gradients.
15:     Extensive experimental exploration is performed on CIFAR10 with further validation
16:     given on ImageNet.
17: \end{abstract}
18: