abstract:36520b4fb21b3caf.tex

1: \begin{abstract}

2: %

3: In stochastic optimization,

4: the population risk is generally approximated by the empirical risk.

5: However, in the large-scale setting, minimization of the empirical risk may be

6: computationally restrictive.

7: In this paper, we design

8: an efficient algorithm to approximate the population risk minimizer

9: in generalized linear problems such as

10: binary classification with surrogate losses and generalized linear regression models.

11: We focus on large-scale problems,

12: where the iterative minimization of the empirical risk is computationally intractable, i.e.,

13: the number of observations $n$

14: is much larger than the dimension of the parameter $p$, i.e. $n \gg p \gg 1$.

15: %

16: We show that under random sub-Gaussian design,

17: the true minimizer of the population risk is approximately proportional to the

18: corresponding ordinary least squares (OLS) estimator.

19: Using this relation,

20: we design an algorithm that achieves the same accuracy as

21: the empirical risk minimizer

22: through iterations that

23: attain up to a cubic convergence rate,

24: and that are cheaper than

25: any batch optimization algorithm by at least a factor of $\mathcal{O}(p)$.

26: %

27: We provide theoretical guarantees for our algorithm,

28: and analyze the convergence behavior in terms of data dimensions.

29: %

30: Finally, we demonstrate the performance of

31: our algorithm on well-known classification and regression problems,

32: through extensive numerical studies

33: on large-scale datasets, and show that

34: it achieves the highest performance compared to several other widely

35: used and specialized optimization algorithms.

36: %%

37: \end{abstract}

38: