3cfc6925f910e1b0.tex
1: \begin{abstract} 
2:   We propose a bilevel optimization strategy for selecting the best hyperparameter value for the nonsmooth $\ell_p$ regularizer with $0<p\le 1$. The concerned bilevel optimization problem has a nonsmooth, possibly nonconvex, $\ell_p$-regularized problem as the lower-level problem. Despite the recent popularity of nonconvex $\ell_p$ regularizer and  the usefulness of bilevel optimization for selecting hyperparameters, algorithms for such bilevel problems have not been studied %in both machine learning and mathematical optimization fields
3:   because of the difficulty of $\ell_p$ regularizer.
4:   %For solving the bilevel optimization problem,
5:   We first show new optimality conditions for such bilevel optimization problems and then propose a smoothing-type algorithm together with convergence analysis. The proposed algorithm is simple and scalable as
6:   our numerical comparison to Bayesian optimization and grid search
7:   indicates. %through comparison
8: {It is a promising algorithm  for nonsmooth nonconvex bilevel optimization problems as the first algorithm with convergence guarantee.} 
9: \end{abstract}