abstract:cc89db2bfba23d97.tex

1: \begin{abstract}

2: We study the problem of dynamic spectrum sensing and access in

3: cognitive radio systems as a partially observed Markov decision

4: process (POMDP). A group of cognitive users cooperatively tries

5: to exploit vacancies in primary (licensed) channels whose

6: occupancies follow a Markovian evolution. We first consider the

7: scenario where the cognitive users have perfect knowledge of

8: the distribution of the signals they receive from the primary

9: users. For this problem, we obtain a greedy channel selection

10: and access policy that maximizes the instantaneous reward,

11: while satisfying a constraint on the probability of interfering

12: with licensed transmissions. We also derive an analytical

13: universal upper bound on the performance of the optimal policy.

14: Through simulation, we show that our scheme achieves good

15: performance relative to the upper bound and improved

16: performance relative to an existing scheme.

17:

18: %Our scheme also gives better guarantees on synchronization

19: %between the secondary transmitter and receiver, although it

20: %requires a control channel for.

21:

22: We then consider the more practical scenario where the exact

23: distribution of the signal from the primary is unknown. We

24: assume a parametric model for the distribution and develop an

25: algorithm that can learn the true distribution, still

26: guaranteeing the constraint on the interference probability. We

27: show that this algorithm outperforms the naive design that

28: assumes a worst case value for the parameter. We also provide a

29: proof for the convergence of the learning algorithm.

30: \end{abstract}

31: