1: \begin{abstract}
2: As the number of samples and dimensionality of optimization problems related
3: to \GGCorrRev{statistic}{statistics} an machine learning explode, block
4: coordinate descent algorithms have gained popularity \GGCorr{since they
5: are able to}{since they} reduce the original problem to several smaller ones.
6: Coordinates to be optimized are usually selected randomly according
7: to a given probability distribution. \GGCorr{In this work, we}{We} introduce an importance
8: sampling strategy that helps randomized coordinate descent algorithms
9: to focus on blocks that are still far from convergence. The framework
10: applies to problems composed of the sum of two possibly non-convex
11: terms, one being separable and non-smooth. We have compared our algorithm to
12: a full gradient proximal approach as well as to a randomized block
13: coordinate algorithm that considers uniform sampling and cyclic block
14: coordinate descent. \GGCorr{Our
15: experimental results on toy and real-world problems,}{Experimental evidences} show the clear benefit of using an importance sampling strategy.
16: \end{abstract}
17: