abstract:b09372462df27c56.tex

1: \begin{abstract}

2:   We study the concept of fairness for linear bandit problems.  We

3:   extend the framework of meritocratic fairness for finite action

4:   bandit problems introduced by~\citet{JKMR16} by relaxing several

5:   assumptions made in that work.  First, we need not assume the

6:   individuals our algorithms choose between are segmented into

7:   (preassigned) groups, nor that exactly one member of each group

8:   arrives in each round. We further extend the framework to consider

9:   algorithms which may select a varying number of individuals each

10:   round. In this more general framework, we then design a class of

11:   algorithms whose regret is comparable with the guarantees of the

12:   simpler framework.  Second, we introduce a definition of fairness

13:   for infinite action bandit problems.  We provide a fair algorithm

14:   for infinite linear bandit problems with an instance-dependent

15:   regret guarantee and match it with an almost-tight

16:   instance-dependent regret lower bound.

17: \end{abstract}

18: