8d6c0ac55fdc61cf.tex
1: \begin{abstract}
2:     Counterfactual regret minimization (CFR) is a family of algorithms for effectively solving imperfect-information games.
3:     It decomposes the total regret into counterfactual regrets, utilizing local regret minimization algorithms, such as Regret Matching (RM) or RM+, to minimize them.
4:     Recent research establishes a connection between Online Mirror Descent (OMD) and RM+, paving the way for an optimistic variant PRM+ and its extension PCFR+.
5:     However, PCFR+ assigns uniform weights for each iteration when determining regrets, leading to substantial regrets when facing dominated actions.
6:     This work explores minimizing weighted counterfactual regret with optimistic OMD, resulting in a novel CFR variant PDCFR+.
7:     It integrates PCFR+ and Discounted CFR (DCFR) in a principled manner, swiftly mitigating negative effects of dominated actions and consistently leveraging predictions to accelerate convergence.
8:     Theoretical analyses prove that PDCFR+ converges to a Nash equilibrium, particularly under distinct weighting schemes for regrets and average strategies.
9:     Experimental results demonstrate PDCFR+'s fast convergence in common imperfect-information games.
10:     The code is available at \url{https://github.com/rpSebastian/PDCFRPlus}.
11: \end{abstract}
12: