0658aa98d7969fa9.tex
1: \begin{abstract}
2: This paper shows how universal learning can be achieved with
3: expert advice. To this aim, we specify an experts algorithm with
4: the following characteristics: (a) it uses only feedback from the
5: actions actually chosen (bandit setup), (b) it can be applied with
6: countably infinite expert classes, and (c) it copes with losses
7: that may grow in time appropriately slowly. We prove loss bounds
8: against an adaptive adversary. From this, we obtain a master
9: algorithm for ``reactive" experts problems, which means that the
10: master's actions may influence the behavior of the adversary. Our
11: algorithm can significantly outperform standard experts algorithms
12: on such problems. Finally, we combine it with a universal expert
13: class. The resulting universal learner performs -- in a certain
14: sense -- almost as well as any computable strategy, for any online
15: decision problem. We also specify the (worst-case) convergence
16: speed, which is very slow.
17: \iftecrep
18: 
19: {\bf Keywords.} Prediction with expert advice, responsive
20: environments, partial observation game, bandits, universal
21: learning, asymptotic optimality.
22: \fi
23: \end{abstract}
24: