87ec73816432f5e4.tex
1: \begin{abstract}
2: Online learning naturally arises in many statistical and machine learning problems. 
3: The most widely used methods in online learning are stochastic first-order algorithms. 
4: Among this family of algorithms, there is a recently developed algorithm, Recursive One-Over-T SGD (ROOT-SGD). 
5: ROOT-SGD is advantageous in that it converges at a non-asymptotically fast rate, and its estimator further converges to a normal distribution. 
6: However, this normal distribution has unknown asymptotic covariance; thus cannot be directly applied to measure the uncertainty. 
7: To fill this gap, we develop two estimators for the asymptotic covariance of ROOT-SGD. 
8: Our covariance estimators are useful for statistical inference in ROOT-SGD. 
9: Our first estimator adopts the idea of plug-in. 
10: For each unknown component in the formula of the asymptotic covariance, we substitute it with its empirical counterpart. 
11: The plug-in estimator converges at the rate $\mathcal{O}(1/\sqrt{t})$, where $t$ is the sample size. 
12: Despite its quick convergence, the plug-in estimator has the limitation that it relies on the Hessian of the loss function, which might be unavailable in some cases. 
13: Our second estimator is a Hessian-free estimator that overcomes the aforementioned limitation. 
14: The Hessian-free estimator uses the random-scaling technique, and we show that it is an asymptotically consistent estimator of the true covariance. 
15: \end{abstract}
16: