1: \begin{abstract}
2: Data subject to heavy-tailed errors are commonly encountered in various scientific fields,
3: especially in the modern era with explosion of massive data. To address this problem, procedures
4: based on quantile regression and Least Absolute Deviation (LAD) regression have been developed in
5: recent years. These methods essentially estimate the conditional median (or quantile) function.
6: They can be very different from the conditional mean functions when distributions are asymmetric and heteroscedastic. How can we efficiently estimate
7: the mean regression functions in ultra-high dimensional setting with existence of only the second
8: moment? To solve this problem, we propose a penalized Huber loss with diverging parameter to
9: reduce biases created by the traditional Huber loss. Such a penalized robust approximate
10: quadratic (RA-quadratic) loss will be called
11: RA-Lasso. In the ultra-high dimensional setting, where the dimensionality can grow exponentially
12: with the sample size, our results reveal that the RA-lasso estimator produces a consistent
13: estimator at the same rate as the optimal rate under the light-tail situation. We further study
14: the computational convergence of RA-Lasso and show that the composite gradient descent algorithm
15: indeed produces a solution that admits the same optimal rate after sufficient iterations. As a
16: byproduct, we also establish the concentration inequality for estimating population mean when
17: there exists only the second moment. We compare RA-Lasso with other regularized robust
18: estimators based on quantile regression and LAD regression. Extensive simulation studies
19: demonstrate the satisfactory finite-sample performance of RA-Lasso.
20: \end{abstract}
21: