abstract:9462ff087c6aa60e.tex

1: \begin{abstract}

2: We investigate and provide new insights on the sampling rule called Top-Two Thompson Sampling (\TTTS). In particular, we justify its use for \emph{fixed-confidence best-arm identification}. We further propose a variant of \TTTS called Top-Two Transportation Cost (\TCC), which disposes of the computational burden of \TTTS. As our main contribution, we provide the first sample complexity analysis of \TTTS and \TCC when coupled with a very natural Bayesian stopping rule, for bandits with Gaussian rewards, solving one of the open questions raised by ~\citet{russo2016ttts}.

3: %We can further see that the proposed stopping rule is indeed closely related to the usual Chernoff stopping rule.

4: We also provide new posterior convergence results for \TTTS under two models that are commonly used in practice: bandits with Gaussian and Bernoulli rewards and conjugate priors.

5: \end{abstract}

6: