38c29b813b4afdc5.tex
1: \begin{abstract}
2: We study --both in theory and practice-- the use of momentum in classic iterative hard thresholding (IHT) methods. 
3: By simply modifying classical IHT, we investigate its convergence behavior on convex optimization criteria with non-convex constraints, under standard assumptions.
4: We observe that acceleration in IHT leads to significant improvements, compared to state of the art projected gradient descent and Frank-Wolfe variants.
5: As a byproduct of our inspection, we study the impact of selecting the momentum parameter: similar to convex settings, two modes of behavior are observed --``rippling'' and linear-- depending on the level of momentum.
6: \end{abstract}
7: