cc89db2bfba23d97.tex
1: \begin{abstract}
2: We study the problem of dynamic spectrum sensing and access in
3: cognitive radio systems as a partially observed Markov decision
4: process (POMDP). A group of cognitive users cooperatively tries
5: to exploit vacancies in primary (licensed) channels whose
6: occupancies follow a Markovian evolution. We first consider the
7: scenario where the cognitive users have perfect knowledge of
8: the distribution of the signals they receive from the primary
9: users. For this problem, we obtain a greedy channel selection
10: and access policy that maximizes the instantaneous reward,
11: while satisfying a constraint on the probability of interfering
12: with licensed transmissions. We also derive an analytical
13: universal upper bound on the performance of the optimal policy.
14: Through simulation, we show that our scheme achieves good
15: performance relative to the upper bound and improved
16: performance relative to an existing scheme.
17: 
18: %Our scheme also gives better guarantees on synchronization
19: %between the secondary transmitter and receiver, although it
20: %requires a control channel for.
21: 
22: We then consider the more practical scenario where the exact
23: distribution of the signal from the primary is unknown. We
24: assume a parametric model for the distribution and develop an
25: algorithm that can learn the true distribution, still
26: guaranteeing the constraint on the interference probability. We
27: show that this algorithm outperforms the naive design that
28: assumes a worst case value for the parameter. We also provide a
29: proof for the convergence of the learning algorithm.
30: \end{abstract}
31: