abstract:c9647eac3a315a15.tex

1: \begin{abstract}

2:     We provide an information-theoretic framework for studying the generalization properties of machine learning algorithms. Our framework ties together existing approaches, including uniform convergence bounds and recent methods for adaptive data analysis.

3:

4:     Specifically, we use Conditional Mutual Information (CMI) to quantify how well the input (i.e., the training data) can be recognized given the output (i.e., the trained model) of the learning algorithm.

5:     We show that bounds on CMI can be obtained from VC dimension, compression schemes, differential privacy, and other methods. We then show that bounded CMI implies various forms of generalization.

6:

7:     %Finally, we show how the CMI framework can be extended to analyze adaptive composition.

8: \end{abstract}

9: