b09372462df27c56.tex
1: \begin{abstract}
2:   We study the concept of fairness for linear bandit problems.  We
3:   extend the framework of meritocratic fairness for finite action
4:   bandit problems introduced by~\citet{JKMR16} by relaxing several
5:   assumptions made in that work.  First, we need not assume the
6:   individuals our algorithms choose between are segmented into
7:   (preassigned) groups, nor that exactly one member of each group
8:   arrives in each round. We further extend the framework to consider
9:   algorithms which may select a varying number of individuals each
10:   round. In this more general framework, we then design a class of
11:   algorithms whose regret is comparable with the guarantees of the
12:   simpler framework.  Second, we introduce a definition of fairness
13:   for infinite action bandit problems.  We provide a fair algorithm
14:   for infinite linear bandit problems with an instance-dependent
15:   regret guarantee and match it with an almost-tight
16:   instance-dependent regret lower bound.
17: \end{abstract}
18: