abstract:0658aa98d7969fa9.tex

1: \begin{abstract}

2: This paper shows how universal learning can be achieved with

3: expert advice. To this aim, we specify an experts algorithm with

4: the following characteristics: (a) it uses only feedback from the

5: actions actually chosen (bandit setup), (b) it can be applied with

6: countably infinite expert classes, and (c) it copes with losses

7: that may grow in time appropriately slowly. We prove loss bounds

8: against an adaptive adversary. From this, we obtain a master

9: algorithm for ``reactive" experts problems, which means that the

10: master's actions may influence the behavior of the adversary. Our

11: algorithm can significantly outperform standard experts algorithms

12: on such problems. Finally, we combine it with a universal expert

13: class. The resulting universal learner performs -- in a certain

14: sense -- almost as well as any computable strategy, for any online

15: decision problem. We also specify the (worst-case) convergence

16: speed, which is very slow.

17: \iftecrep

18:

19: {\bf Keywords.} Prediction with expert advice, responsive

20: environments, partial observation game, bandits, universal

21: learning, asymptotic optimality.

22: \fi

23: \end{abstract}

24: