6128750bf84f9452.tex
1: \begin{abstract}
2: We consider concurrent games played on graphs.  
3: At every round of a game, each player simultaneously and independently 
4: selects a move;
5: the moves jointly determine the transition to a successor state. 
6: Two basic objectives are the safety objective to stay forever in a given 
7: set of states, and its dual, the reachability objective to reach a given 
8: set of states.
9: First, we present a simple proof of the fact that in concurrent reachability 
10: games, for all $\vare>0$, memoryless $\vare$-optimal strategies exist.  
11: A memoryless strategy is independent of the history of plays, and an 
12: $\vare$-optimal strategy achieves the objective with probability within 
13: $\vare$ of the value of the game.  
14: In contrast to previous proofs of this fact, 
15: our proof is more elementary and more combinatorial.  
16: Second, we present a strategy-improvement (a.k.a.\ policy-iteration) algorithm 
17: for concurrent games with reachability objectives. 
18: We then present a strategy-improvement algorithm for concurrent games with 
19: safety objectives. 
20: Our algorithms yield sequences of player-1 strategies which ensure
21: probabilities of winning that converge monotonically to the value of the game. 
22: Our result is significant because the strategy-improvement algorithm 
23: for safety games provides, for the first time, a way to approximate the 
24: value of a concurrent safety game {\em from below}.
25: Previous methods could approximate the values of these games only from one 
26: direction, and as no rates of convergence are known, they did not provide a
27: practical way to solve these games.
28: \end{abstract}