1: \begin{abstract}
2: We consider concurrent games played on graphs.
3: At every round of a game, each player simultaneously and independently
4: selects a move;
5: the moves jointly determine the transition to a successor state.
6: Two basic objectives are the safety objective to stay forever in a given
7: set of states, and its dual, the reachability objective to reach a given
8: set of states.
9: First, we present a simple proof of the fact that in concurrent reachability
10: games, for all $\vare>0$, memoryless $\vare$-optimal strategies exist.
11: A memoryless strategy is independent of the history of plays, and an
12: $\vare$-optimal strategy achieves the objective with probability within
13: $\vare$ of the value of the game.
14: In contrast to previous proofs of this fact,
15: our proof is more elementary and more combinatorial.
16: Second, we present a strategy-improvement (a.k.a.\ policy-iteration) algorithm
17: for concurrent games with reachability objectives.
18: We then present a strategy-improvement algorithm for concurrent games with
19: safety objectives.
20: Our algorithms yield sequences of player-1 strategies which ensure
21: probabilities of winning that converge monotonically to the value of the game.
22: Our result is significant because the strategy-improvement algorithm
23: for safety games provides, for the first time, a way to approximate the
24: value of a concurrent safety game {\em from below}.
25: Previous methods could approximate the values of these games only from one
26: direction, and as no rates of convergence are known, they did not provide a
27: practical way to solve these games.
28: \end{abstract}