abstract:108f8806407fd103.tex

1: \begin{abstract}

2: %In this paper, we focus on solving two-player zero-sum extensive games with imperfect information.

3: Counterfactual regret minimization (CFR) is the most popular algorithm on solving two-player zero-sum extensive games with imperfect information and %\junz{summary the key problems of CFR we want to solve.}

4: achieves state-of-the-art results in practice.

5: However, the performance of CFR is not fully understood, since empirical results on the regret are much better than the known upper bound in \cite{zinkevich2008regret}. Moreover, CFR has to traverse the whole game tree in each round, which is time-consuming in large scale games%\dong{which is unnecessary or very time consuming? not tolerable seems vague}

6: . In this paper, we present a novel technique, lazy update, which can avoid traversing the whole game tree in each round. We propose a novel analysis on the regret of CFR with lazy update, which can also be applied to the vanilla CFR, resulting in a much tighter regret bound than that in \cite{zinkevich2008regret}. Inspired by lazy update, we further present a novel CFR variant, named Lazy-CFR. Compared to traversing $O(|\mathcal{I}|)$ information sets in the vanilla CFR, Lazy-CFR needs only to traverse $O(\sqrt{|\mathcal{I}|})$ information sets per round while keeping the regret bound almost the same, where $\mathcal{I}$ is the class of all information sets. As a result, Lazy-CFR shows better convergence results compared with the vanilla CFR. Experimental results consistently show that Lazy-CFR outperforms the vanilla CFR significantly.

7: \end{abstract}

8: