1: \begin{abstract}
2: We establish a tight characterization of the worst-case rates for
3: the excess risk of agnostic learning with sample compression schemes
4: and for uniform convergence for agnostic sample compression schemes.
5: In particular, we find that the optimal rates of convergence
6: for size-$k$ agnostic sample compression schemes are of the form
7: $\sqrt{\frac{k \log(n/k)}{n}}$, which contrasts with agnostic learning
8: with classes of VC dimension $k$, where the optimal rates are of the form
9: $\sqrt{\frac{k}{n}}$.
10: \end{abstract}
11: