1: \begin{abstract}
2: In online learning an algorithm plays against an
3: environment with losses possibly picked by
4: an adversary at each round.
5: The generality of this framework includes
6: problems that are not adversarial, for example
7: offline optimization, or saddle point problems (i.e. min max optimization).
8: However, online algorithms are typically not designed
9: to leverage additional structure present in non-adversarial problems.
10: Recently, slight modifications to well-known online
11: algorithms such as optimism and adaptive step sizes
12: have been used
13: in several domains to accelerate online learning --
14: recovering optimal rates in offline smooth optimization,
15: and accelerating convergence to saddle points or social
16: welfare in smooth games.
17: In this work we introduce optimism and adaptive stepsizes
18: to Lagrangian hedging, a class of online algorithms
19: that includes regret-matching, and hedge (i.e. multiplicative weights).
20: Our results include: a general general regret bound; a path length regret bound for a fixed
21: smooth loss, applicable to an optimistic variant of regret-matching and regret-matching+; optimistic regret bounds for $\Phi$ regret,
22: a framework that includes external, internal, and swap regret;
23: and optimistic bounds for a family of algorithms that includes
24: regret-matching+ as a special case.
25:
26: \end{abstract}
27: