1: \begin{abstract}
2: Smoothing splines have been used pervasively in nonparametric regressions. However, the computational burden of smoothing splines is significant when the sample size $n$ is large.
3: When the number of predictors $d\geq2$, the computational cost for smoothing splines is at the order of $O(n^3)$ using the standard approach.
4: Many methods have been developed to approximate smoothing spline estimators by using $q$ basis functions instead of $n$ ones, resulting in a computational cost of the order $O(nq^2)$.
5: These methods are called the basis selection methods.
6: Despite algorithmic benefits, most of the basis selection methods require the assumption that the sample is uniformly-distributed on a hyper-cube.
7: These methods may have deteriorating performance when such an assumption is not met.
8: To overcome the obstacle, we develop an efficient algorithm that is adaptive to the unknown probability density function of the predictors.
9: Theoretically, we show the proposed estimator has the same convergence rate as the full-basis estimator when $q$ is roughly at the order of $O[n^{2d/\{(pr+1)(d+2)\}}]$, where $p\in[1,2]$ and $r\approx 4$ are some constants depend on the type of the spline.
10: Numerical studies on various synthetic datasets demonstrate the superior performance of the proposed estimator in comparison with mainstream competitors.
11: \end{abstract}
12: