abstract:a0d9c931f670d212.tex

1: \begin{abstract}

2:     We develop a Levenberg-Marquardt method for minimizing the sum of a smooth nonlinear least-squares term \(f(x) = \tfrac{1}{2} \|F(x)\|_2^2\) and a nonsmooth term \(h\).

3:     Both \(f\) and \(h\) may be nonconvex.

4:     Steps are computed by minimizing the sum of a regularized linear least-squares model and a model of \(h\) using a first-order method such as the proximal gradient method.

5:     We establish global convergence to a first-order stationary point of both a trust-region and a regularization variant of the Levenberg-Marquardt method under the assumptions that \(F\) and its Jacobian are Lipschitz continuous and \(h\) is proper and lower semi-continuous.

6:     In the worst case, both methods perform \(O(\epsilon^{-2})\) iterations to bring a measure of stationarity below \(\epsilon \in (0, 1)\).

7:     We report numerical results on three examples: a group-lasso basis-pursuit denoise example, a nonlinear support vector machine, and parameter estimation in neuron firing.

8:     For those examples to be implementable, we describe in detail how to evaluate proximal operators for separable \(h\) and for the group lasso with trust-region constraint.

9:     In all cases, the Levenberg-Marquardt methods perform fewer outer iterations than a proximal-gradient method with adaptive step length and a quasi-Newton trust-region method, neither of which exploit the least-squares structure of the problem.

10:     Our results also highlight the need for more sophisticated subproblem solvers than simple first-order methods.

11:   \end{abstract}

12: