1: \begin{abstract}
2: In this paper we study the problem of recovering a structured but unknown parameter $\vct{\theta}^*$ from $n$ nonlinear observations of the form $y_i=f(\langle \vct{x}_i,\vct{\theta}^*\rangle)$ for $i=1,2,\ldots,n$. We develop a framework for characterizing time-data tradeoffs for a variety of parameter estimation algorithms when the nonlinear function $f$ is unknown. This framework includes many popular heuristics such as projected/proximal gradient descent and stochastic schemes. For example, we show that a projected gradient descent scheme converges at a linear rate to a reliable solution with a near minimal number of samples. We provide a sharp characterization of the convergence rate of such algorithms as a function of sample size, amount of a-prior knowledge available about the parameter and a measure of the nonlinearity of the function $f$. These results provide a precise understanding of the various tradeoffs involved between statistical and computational resources as well as a-prior side information available for such nonlinear parameter estimation problems.
3:
4: %We consider a general model in which the aim is to estimate an unknown parameter $\bbeta$ from noisy and possibly nonlinear observations $f(\X\bbeta)$. In statistics this is known as the single index model and it encompasses several models including lasso, logistic regression and more recently it finds applications in one-bit compressed sensing. In this work, we introduce fast iterative algorithms that achieves linear convergence to the underlying parameter. In important instances including projected gradient descent, we achieve optimal estimation error rates in terms of the nonlinearity and the problem geometry.
5: \end{abstract}
6: