f037847d3f15aae9.tex
1: \begin{abstract}%   <- trailing '%' for backward compatibility of .sty file
2: We consider the problem of learning a non-negative linear classifier with
3: a $1$-norm of at most $k$, and a fixed threshold, under the hinge-loss. This
4: problem generalizes the problem of learning a $k$-monotone disjunction. We
5: prove that we can learn efficiently in this setting, at a rate which is
6: linear in both $k$ and the size of the threshold, and that this is the best
7: possible rate. We provide an efficient online learning algorithm that
8: achieves the optimal rate, and show that in the batch case, empirical risk
9: minimization achieves this rate as well. The rates we show are tighter than the uniform convergence rate, which grows with $k^2$.
10: \end{abstract}
11: