abstract:d36c2dec0b44e32c.tex

1: \begin{abstract}

2:   Data subject to heavy-tailed errors are commonly encountered in various scientific fields,

3:   especially in the modern era with explosion of massive data.  To address this problem, procedures

4:   based on quantile regression and Least Absolute Deviation (LAD) regression have been developed in

5:   recent years. These methods essentially estimate the conditional median (or quantile) function.

6:   They can be very different from the conditional mean functions when distributions are asymmetric and heteroscedastic.  How can we efficiently estimate

7:   the mean regression functions in ultra-high dimensional setting with existence of only the second

8:   moment?  To solve this problem, we propose a penalized Huber loss with diverging parameter to

9:   reduce biases created by the traditional Huber loss.  Such a penalized robust approximate

10:   quadratic (RA-quadratic) loss will be called

11:   RA-Lasso.  In the ultra-high dimensional setting, where the dimensionality can grow exponentially

12:   with the sample size, our results reveal that the RA-lasso estimator produces a consistent

13:   estimator at the same rate as the optimal rate under the light-tail situation.  We further study

14:   the computational convergence of RA-Lasso and show that the composite gradient descent algorithm

15:   indeed produces a solution that admits the same optimal rate after sufficient iterations.  As a

16:   byproduct, we also establish the concentration inequality for estimating population mean when

17:   there exists only the second moment.  We compare RA-Lasso with other regularized robust

18:   estimators based on quantile regression and LAD regression. Extensive simulation studies

19:   demonstrate the satisfactory finite-sample performance of RA-Lasso.

20: \end{abstract}

21: