e0b3bafbe20a9bb5.tex
1: \begin{abstract}
2: We establish a tight characterization of the worst-case rates for 
3: the excess risk of agnostic learning with sample compression schemes 
4: and for uniform convergence for agnostic sample compression schemes.
5: In particular, we find that the optimal rates of convergence 
6: for size-$k$ agnostic sample compression schemes are of the form 
7: $\sqrt{\frac{k \log(n/k)}{n}}$, which contrasts with agnostic learning 
8: with classes of VC dimension $k$, where the optimal rates are of the form 
9: $\sqrt{\frac{k}{n}}$.
10: \end{abstract}
11: