abstract:8d6c0ac55fdc61cf.tex

1: \begin{abstract}

2:     Counterfactual regret minimization (CFR) is a family of algorithms for effectively solving imperfect-information games.

3:     It decomposes the total regret into counterfactual regrets, utilizing local regret minimization algorithms, such as Regret Matching (RM) or RM+, to minimize them.

4:     Recent research establishes a connection between Online Mirror Descent (OMD) and RM+, paving the way for an optimistic variant PRM+ and its extension PCFR+.

5:     However, PCFR+ assigns uniform weights for each iteration when determining regrets, leading to substantial regrets when facing dominated actions.

6:     This work explores minimizing weighted counterfactual regret with optimistic OMD, resulting in a novel CFR variant PDCFR+.

7:     It integrates PCFR+ and Discounted CFR (DCFR) in a principled manner, swiftly mitigating negative effects of dominated actions and consistently leveraging predictions to accelerate convergence.

8:     Theoretical analyses prove that PDCFR+ converges to a Nash equilibrium, particularly under distinct weighting schemes for regrets and average strategies.

9:     Experimental results demonstrate PDCFR+'s fast convergence in common imperfect-information games.

10:     The code is available at \url{https://github.com/rpSebastian/PDCFRPlus}.

11: \end{abstract}

12: