82cd0953372c20b6.tex
1: \begin{abstract}
2: 	Non-smooth optimization is a core ingredient of many imaging or machine learning pipelines. 
3: 	%
4: 	Non-smoothness encodes structural constraints on the solutions, such as sparsity, group sparsity, low-rank and sharp edges.
5: 	It is also the basis for the definition of robust loss functions and scale-free functionals such as square-root Lasso.  
6: 	%
7: 	Standard approaches to deal with non-smoothness leverage either proximal splitting or coordinate descent. These approaches are effective but usually require parameter tuning, preconditioning or some sort of support pruning.  
8: 	%
9: 	In this work, we advocate and study a different route, which operates a non-convex but smooth over-parametrization of the underlying non-smooth optimization problems. This generalizes quadratic variational forms that are at the heart of the popular Iterative Reweighted Least Squares (IRLS).
10: 	%
11: 	Our main theoretical contribution connects gradient descent on this reformulation to a mirror descent flow with a varying Hessian metric. This analysis is crucial to derive convergence bounds that are dimension-free. This explains the efficiency of the method when using small grid sizes in imaging.
12: 	%
13: 	Our main algorithmic contribution is to apply the Variable Projection (VarPro) method which defines a new formulation by explicitly minimizing over part of the variables. This leads to a better conditioning of the minimized functional and improves the convergence of simple but very efficient gradient-based methods, for instance quasi-Newton solvers. 
14: 	%
15: 	We exemplify the use of this new solver for the resolution of regularized regression problems for inverse problems and supervised learning, including  total variation prior and non-convex regularizers. 
16: 	%
17: \keywords{Sparsity \and low-rank \and compressed sensing \and variable projection \and mirror descent \and non-convex optimization}
18: \subclass{68Q25, 68R10, 68U05}
19: \end{abstract}
20: