94025a0780674aee.tex
1: \begin{abstract}
2:   We study a perturbed version of the proximal gradient algorithm for which the
3:   gradient is not known in closed form and should be approximated. We address the convergence and derive a
4:   non-asymptotic bound on the convergence rate for the perturbed proximal
5:   gradient, a perturbed averaged version of the proximal gradient algorithm and
6:   a perturbed version of the fast iterative shrinkage-thresholding (FISTA) of
7:   \cite{becketteboulle09}.  When the approximation is achieved by using Monte Carlo methods,
8:   we derive conditions involving the Monte Carlo batch-size and the
9:   step-size of the algorithm under which convergence is guaranteed.  In
10:   particular, we show that the Monte Carlo approximations of some averaged proximal
11:   gradient algorithms and a Monte Carlo approximation of FISTA  achieve the same
12:   convergence rates as their deterministic counterparts.  To illustrate, we apply
13:   the algorithms to high-dimensional generalized linear mixed models using $\ell_1$-penalization.
14: \end{abstract}
15: