f85260611f888978.tex
1: \begin{abstract}
2:   We consider the problem of sequential sampling from a finite number
3:   of independent statistical populations to maximize the expected
4:   infinite horizon average outcome per period, under a constraint that
5:   the expected average sampling cost does not exceed an upper
6:   bound. The outcome distributions are not known. We construct a class
7:   of consistent adaptive policies, under which the average outcome
8:   converges with probability 1 to the true value under complete information for all
9:   distributions with finite means. We also compare the rate of
10:   convergence for various policies in this class using simulation.
11: \end{abstract}
12: