abstract:7eb73a365d924c7a.tex

1: \begin{abstract}

2: Smoothing splines have been used pervasively in nonparametric regressions. However, the computational burden of smoothing splines is significant when the sample size $n$ is large.

3: When the number of predictors $d\geq2$, the computational cost for smoothing splines is at the order of $O(n^3)$ using the standard approach.

4: Many methods have been developed to approximate smoothing spline estimators by using $q$ basis functions instead of $n$ ones, resulting in a computational cost of the order $O(nq^2)$.

5: These methods are called the basis selection methods.

6: Despite algorithmic benefits, most of the basis selection methods require the assumption that the sample is uniformly-distributed on a hyper-cube.

7: These methods may have deteriorating performance when such an assumption is not met.

8: To overcome the obstacle, we develop an efficient algorithm that is adaptive to the unknown probability density function of the predictors.

9: Theoretically, we show the proposed estimator has the same convergence rate as the full-basis estimator when $q$ is roughly at the order of $O[n^{2d/\{(pr+1)(d+2)\}}]$, where $p\in[1,2]$ and $r\approx 4$ are some constants depend on the type of the spline.

10: Numerical studies on various synthetic datasets demonstrate the superior performance of the proposed estimator in comparison with mainstream competitors.

11: \end{abstract}

12: