abstract:6128750bf84f9452.tex

1: \begin{abstract}

2: We consider concurrent games played on graphs.

3: At every round of a game, each player simultaneously and independently

4: selects a move;

5: the moves jointly determine the transition to a successor state.

6: Two basic objectives are the safety objective to stay forever in a given

7: set of states, and its dual, the reachability objective to reach a given

8: set of states.

9: First, we present a simple proof of the fact that in concurrent reachability

10: games, for all $\vare>0$, memoryless $\vare$-optimal strategies exist.

11: A memoryless strategy is independent of the history of plays, and an

12: $\vare$-optimal strategy achieves the objective with probability within

13: $\vare$ of the value of the game.

14: In contrast to previous proofs of this fact,

15: our proof is more elementary and more combinatorial.

16: Second, we present a strategy-improvement (a.k.a.\ policy-iteration) algorithm

17: for concurrent games with reachability objectives.

18: We then present a strategy-improvement algorithm for concurrent games with

19: safety objectives.

20: Our algorithms yield sequences of player-1 strategies which ensure

21: probabilities of winning that converge monotonically to the value of the game.

22: Our result is significant because the strategy-improvement algorithm

23: for safety games provides, for the first time, a way to approximate the

24: value of a concurrent safety game {\em from below}.

25: Previous methods could approximate the values of these games only from one

26: direction, and as no rates of convergence are known, they did not provide a

27: practical way to solve these games.

28: \end{abstract}