abstract:030c36eb3fbfc023.tex

1: \begin{abstract}

2: We examine the rate of convergence of the Lasso estimator of lower dimensional

3: components of the high-dimensional parameter. Under bounds on the $\ell_1$-norm

4: on the worst possible sub-direction these rates are of order $\sqrt {|J| \log p / n }$

5: where $p$ is the total number of parameters, $J \subset \{ 1 , \ldots , p \}$ represents

6: a subset of the parameters and

7: $n$ is the number of observations. We also derive rates in sup-norm

8: in terms of the rate of convergence in $\ell_1$-norm. The irrepresentable condition

9: on a set $J$ requires that the $\ell_1$-norm of the worst possible sub-direction is

10: sufficiently smaller than one.

11: In that case sharp oracle results can be obtained. Moreover, if the

12: coefficients in $J$ are small  enough the Lasso will

13: put these coefficients to zero. This extends known results which say that

14: the irrepresentable condition on the inactive set

15: (the set where coefficients are exactly zero) implies no false positives.

16: We further show that by de-sparsifying one obtains fast rates in supremum

17: norm without conditions on the worst possible sub-direction.

18: The main assumption here is that approximate

19: sparsity is of order $o (\sqrt n / \log p )$.

20: The results are extended to M-estimation with $\ell_1$-penalty for generalized linear models

21: and exponential families for example. For the graphical Lasso this leads to an

22: extension of known results to the case where the precision matrix is only approximately

23: sparse.  The bounds we provide are non-asymptotic but we also present asymptotic

24: formulations for ease of interpretation.

25: \end{abstract}

26: