1: \begin{abstract}
2: We propose a bilevel optimization strategy for selecting the best hyperparameter value for the nonsmooth $\ell_p$ regularizer with $0<p\le 1$. The concerned bilevel optimization problem has a nonsmooth, possibly nonconvex, $\ell_p$-regularized problem as the lower-level problem. Despite the recent popularity of nonconvex $\ell_p$ regularizer and the usefulness of bilevel optimization for selecting hyperparameters, algorithms for such bilevel problems have not been studied %in both machine learning and mathematical optimization fields
3: because of the difficulty of $\ell_p$ regularizer.
4: %For solving the bilevel optimization problem,
5: We first show new optimality conditions for such bilevel optimization problems and then propose a smoothing-type algorithm together with convergence analysis. The proposed algorithm is simple and scalable as
6: our numerical comparison to Bayesian optimization and grid search
7: indicates. %through comparison
8: {It is a promising algorithm for nonsmooth nonconvex bilevel optimization problems as the first algorithm with convergence guarantee.}
9: \end{abstract}