c140046c7ccf868f.tex
1: \begin{abstract}
2: The multi-index model is a simple yet powerful high-dimensional regression model
3: which circumvents the curse of dimensionality
4: assuming $ \bbE [ Y | X ] = g(A^\top X) $
5: for some unknown index space $A$ and link function $g$.
6: In this paper we introduce a method for the estimation of the index space,
7: and study the propagation error of an index space estimate in the regression of the link function.
8: The proposed method approximates the index space
9: by the span of linear regression slope coefficients computed over level sets of the data.
10: Being based on ordinary least squares,
11: our approach is easy to implement and computationally efficient.
12: We prove a tight concentration bound that shows $N^{-1/2}$-convergence,
13: but also faithfully describes the dependence on the chosen partition of level sets,
14: hence giving indications on the hyperparameter tuning.
15: The estimator's competitiveness is confirmed
16: by extensive comparisons with state-of-the-art methods, both on synthetic and real data sets.
17: As a second contribution,
18: we establish minimax optimal generalization bounds for k-nearest neighbors and piecewise polynomial regression
19: when trained on samples projected onto any $N^{-1/2}$-consistent estimate of the index space,
20: thus providing complete and provable estimation of the multi-index model.
21: \end{abstract}
22: