e4521ebf327469b9.tex
1: \begin{abstract}
2: The sparse group lasso optimization problem is solved using a coordinate gradient descent
3: algorithm. The algorithm is applicable to a broad class of convex loss functions.
4: Convergence of the algorithm is established, and the algorithm is used to investigate the
5: performance of the multinomial sparse group lasso classifier. On three different real data
6: examples the multinomial group lasso clearly outperforms multinomial lasso in terms of
7: achieved classification error rate and in terms of including fewer features for the
8: classification. The run-time of our sparse group lasso implementation is of the same order of
9: magnitude as the multinomial lasso algorithm implemented in the R package glmnet. Our
10: implementation scales well with the problem size. One of the high dimensional examples
11: considered is a 50 class classification problem with 10k features, which amounts to
12: estimating 500k parameters. The implementation is available as the R package msgl.
13: \end{abstract}
14: