abstract:6a336b483c4e68c5.tex

1: \begin{abstract}

2: In online learning an algorithm plays against an

3: environment with losses possibly picked by

4: an adversary at each round.

5: The generality of this framework includes

6: problems that are not adversarial, for example

7: offline optimization, or saddle point problems (i.e. min max optimization).

8: However, online algorithms are typically not designed

9: to leverage additional structure present in non-adversarial problems.

10: Recently, slight modifications to well-known online

11: algorithms such as optimism and adaptive step sizes

12: have been used

13: in several domains to accelerate online learning --

14: recovering optimal rates in offline smooth optimization,

15: and accelerating convergence to saddle points or social

16: welfare in smooth games.

17: In this work we introduce optimism and adaptive stepsizes

18: to Lagrangian hedging, a class of online algorithms

19: that includes regret-matching, and hedge (i.e. multiplicative weights).

20: Our results include: a general general regret bound; a path length regret bound for a fixed

21: smooth loss, applicable to an optimistic variant of regret-matching and regret-matching+; optimistic regret bounds for $\Phi$ regret,

22: a framework that includes external, internal, and swap regret;

23: and optimistic bounds for a family of algorithms that includes

24: regret-matching+ as a special case.

25:

26: \end{abstract}

27: