7750a180e9d58afa.tex
1: \begin{abstract}
2:     Unbiased label noise is frequently assumed in the convergence analysis for statistical learning of generalized linear models. When performing linear regression, the natural decomposition of the noise and the true loss ensures the consistency of Ordinary Least Square (OLS) estimator with decent asymptotic rates. In this work, we demonstrate the implicit regularization effects of running SGD for OLS estimator with unbiased label noises under mini-batch sampling settings. 
3:     %In this work, we analyze the asymptotic convergence of deep learning, where the training data is assumed being drawn from a deep neural network with random weights with (unbiased) noisy labels. Our work demonstrate that the decomposition of true parameter and the noise might not always exist in deep learning settings, while the unbiased noise would cause bias of OLS estimator for deep neural networks from the perspectives of stochastic gradients. Blessed by such bias, deep learning however could enjoy certain privilege caused by the implicit regularization effects. Compared to the existing work, I am the first to reconsider the bias and implicit regularization effects of unbiased label noise to the deep learning with OLS estimator. The simulation study backs up our theories. 
4: \end{abstract}
5: