1: \begin{abstract}
2: There is a clear need for efficient algorithms to tune hyperparameters for statistical learning schemes, since the commonly applied search methods (such as grid search with N-fold cross-validation) are inefficient and/or approximate. Previously existing algorithms that efficiently search for hyperparameters relying on the smoothness of the cost function cannot be applied in problems such as Lasso regression.
3: % On the other hand, methods that train a neural network to predict the weights given the hyperparameters are interesting; however in problems such as linear or kernel regression, these methods incur in heavy over-parameterization since the number of network parameters becomes larger than the dimensionality of the regression weights and the hyperparameters together.
4: In this contribution, we develop a hyperparameter optimization method that relies on the structure of proximal gradient methods and does not require a smooth cost function. Such a method is applied to Leave-one-out (LOO)-validated Lasso and Group Lasso to yield efficient, data-driven, hyperparameter optimization algorithms.
5: %Moreover, with slight modifications, the method can be turned into an online algorithm capable of dealing with sequential data, amenable for learning from high-dimensional time series and big data.
6: Numerical experiments corroborate the convergence of the proposed method to a local optimum of the LOO validation error curve, and the efficiency of its approximations.
7:
8: \end{abstract}
9: