c6eb526acff67f92.tex
1: \begin{abstract} 
2: High-dimensional data analysis has been an active area, and the main focuses have been variable selection and dimension reduction. In practice, it occurs often that the variables are located on an unknown, lower-dimensional nonlinear manifold. Under this manifold assumption, one purpose of this paper is regression and gradient estimation on the manifold, and another %goal 
3: is developing a new tool for manifold learning.   To the first aim, we suggest directly reducing the dimensionality to the intrinsic dimension $d$ of the manifold, and performing the popular local linear regression (LLR) on a tangent plane estimate. An immediate consequence is a %substantial gain in the computational speed
4: dramatic reduction in the computation time
5: when the ambient space dimension $p\gg d$. We provide rigorous %detailed 
6: theoretical justification of the convergence of the proposed regression and gradient estimators by carefully analyzing the curvature, boundary, and non-uniform sampling effects.  A bandwidth selector that can handle heteroscedastic errors is proposed.  
7: To the second aim, we analyze carefully the behavior of our regression estimator both in the interior and near the boundary of the manifold, and make explicit its relationship with manifold learning, in particular estimating the Laplace-Beltrami operator of the manifold. In this context, we also make clear that it is important to  use a smaller bandwidth in the tangent plane estimation than in the LLR. %Numerical results show our regression estimator outperforms existing methods in terms of both computational speed and estimation accuracy.
8: Simulation studies and the Isomap face data example are used to illustrate the computational speed and estimation accuracy of our methods. 
9: \end{abstract}
10: