cae22fcbbe3db13f.tex
1: \begin{abstract}
2: 	We develop a novel family of algorithms for the online learning setting with regret against any data sequence bounded by the \emph{empirical Rademacher complexity} of that sequence. To develop a general theory of when this type of adaptive regret bound is achievable we establish a connection to the theory of \emph{decoupling inequalities} for martingales in Banach spaces. When the hypothesis class is a set of linear functions bounded in some norm, such a regret bound is achievable if and only if the norm satisfies certain decoupling inequalities for martingales. Donald Burkholder's celebrated \emph{geometric characterization} of decoupling inequalities \citep{burkholder1984boundary} states that such an inequality holds if and only if there exists a special function called a \emph{Burkholder function}  satisfying certain restricted concavity properties. Our online learning algorithms are efficient in terms of queries to this function. 
3: 
4: We realize our general theory by giving novel efficient algorithms for classes including $\ls_p$ norms, Schatten $p$-norms, group norms, 
5: and reproducing kernel Hilbert spaces. The empirical Rademacher complexity regret bound implies --- when used in the i.i.d. setting --- a \emph{data-dependent} complexity bound for excess risk after online-to-batch conversion. 
6: To showcase the power of the empirical Rademacher complexity regret bound, we derive improved rates for a supervised learning generalization of the \emph{online learning with low rank experts} task and for the \emph{online matrix prediction} task.
7: 
8: In addition to obtaining tight data-dependent regret bounds, our algorithms enjoy improved efficiency over previous techniques based on Rademacher complexity, automatically work in the infinite horizon setting, and are scale-free.
9: To obtain such adaptive methods, we introduce novel machinery, and the resulting algorithms are not based on the standard tools of online convex optimization.
10: \end{abstract}
11: