1: \begin{abstract}
2: We examine the rate of convergence of the Lasso estimator of lower dimensional
3: components of the high-dimensional parameter. Under bounds on the $\ell_1$-norm
4: on the worst possible sub-direction these rates are of order $\sqrt {|J| \log p / n }$
5: where $p$ is the total number of parameters, $J \subset \{ 1 , \ldots , p \}$ represents
6: a subset of the parameters and
7: $n$ is the number of observations. We also derive rates in sup-norm
8: in terms of the rate of convergence in $\ell_1$-norm. The irrepresentable condition
9: on a set $J$ requires that the $\ell_1$-norm of the worst possible sub-direction is
10: sufficiently smaller than one.
11: In that case sharp oracle results can be obtained. Moreover, if the
12: coefficients in $J$ are small enough the Lasso will
13: put these coefficients to zero. This extends known results which say that
14: the irrepresentable condition on the inactive set
15: (the set where coefficients are exactly zero) implies no false positives.
16: We further show that by de-sparsifying one obtains fast rates in supremum
17: norm without conditions on the worst possible sub-direction.
18: The main assumption here is that approximate
19: sparsity is of order $o (\sqrt n / \log p )$.
20: The results are extended to M-estimation with $\ell_1$-penalty for generalized linear models
21: and exponential families for example. For the graphical Lasso this leads to an
22: extension of known results to the case where the precision matrix is only approximately
23: sparse. The bounds we provide are non-asymptotic but we also present asymptotic
24: formulations for ease of interpretation.
25: \end{abstract}
26: