abstract:cae22fcbbe3db13f.tex

1: \begin{abstract}

2: 	We develop a novel family of algorithms for the online learning setting with regret against any data sequence bounded by the \emph{empirical Rademacher complexity} of that sequence. To develop a general theory of when this type of adaptive regret bound is achievable we establish a connection to the theory of \emph{decoupling inequalities} for martingales in Banach spaces. When the hypothesis class is a set of linear functions bounded in some norm, such a regret bound is achievable if and only if the norm satisfies certain decoupling inequalities for martingales. Donald Burkholder's celebrated \emph{geometric characterization} of decoupling inequalities \citep{burkholder1984boundary} states that such an inequality holds if and only if there exists a special function called a \emph{Burkholder function}  satisfying certain restricted concavity properties. Our online learning algorithms are efficient in terms of queries to this function.

3:

4: We realize our general theory by giving novel efficient algorithms for classes including $\ls_p$ norms, Schatten $p$-norms, group norms,

5: and reproducing kernel Hilbert spaces. The empirical Rademacher complexity regret bound implies --- when used in the i.i.d. setting --- a \emph{data-dependent} complexity bound for excess risk after online-to-batch conversion.

6: To showcase the power of the empirical Rademacher complexity regret bound, we derive improved rates for a supervised learning generalization of the \emph{online learning with low rank experts} task and for the \emph{online matrix prediction} task.

7:

8: In addition to obtaining tight data-dependent regret bounds, our algorithms enjoy improved efficiency over previous techniques based on Rademacher complexity, automatically work in the infinite horizon setting, and are scale-free.

9: To obtain such adaptive methods, we introduce novel machinery, and the resulting algorithms are not based on the standard tools of online convex optimization.

10: \end{abstract}

11: