2a4940aeff8cff8f.tex
1: \begin{abstract}
2:   In the domain of machine learning and game theory,
3:   the quest for Nash Equilibrium (NE) in extensive-form games with incomplete information is challenging yet crucial for enhancing AI's decision-making support under varied scenarios.
4:   Traditional Counterfactual Regret Minimization (CFR) techniques excel in navigating towards NE,
5:   focusing on scenarios where opponents deploy optimal strategies.
6:   However,
7:   the essence of machine learning in strategic game play extends beyond reacting to optimal moves;
8:   it encompasses aiding human decision-making in all circumstances.
9:   This includes not only crafting responses to optimal strategies but also recovering from suboptimal decisions and capitalizing on opponents' errors.
10:   Herein lies the significance of transitioning from NE to Bayesian Perfect Equilibrium (BPE),
11:   which accounts for every possible condition,
12:   including the irrationality of opponents.
13: 
14:   To bridge this gap,
15:   we propose Belief Update Fictitious Play (BUFP),
16:   which innovatively blends fictitious play with belief to target BPE,
17:   a more comprehensive equilibrium concept than NE.
18:   Specifically,
19:   through adjusting iteration stepsizes,
20:   BUFP allows for strategic convergence to both NE and BPE.
21:   For instance,
22:   in our experiments, BUFP(EF) leverages the stepsize of Extensive Form Fictitious Play (EFFP) to achieve BPE,
23:   outperforming traditional CFR by securing a 48.53\% increase in benefits in scenarios characterized by dominated strategies.
24: \end{abstract}
25: