a1cf0f741eb4b92d.tex
1: \begin{abstract}
2:         Recent advancements in artificial intelligence (AI) have leveraged large-scale games as benchmarks to gauge progress,
3:         with AI now frequently outperforming human capabilities.
4:         Traditionally, this success has largely relied on solving Nash equilibrium (NE) using variations of the counterfactual regret minimization (CFR) method in games with incomplete information.
5:         However, the variety of Nash equilibria has been largely overlooked in previous research, limiting the adaptability of AI to meet diverse human preferences.
6:         To address this challenge, where AI is powerful but struggles to meet customization needs, we introduce a novel approach: Preference-CFR, which incorporates two new parameters: preference degree and vulnerability degree.
7:         These parameters allow for greater flexibility in AI strategy development without compromising convergence.
8:         Our method significantly alters the distribution of final strategies, enabling the creation of customized AI models that better align with individual user needs.
9:         Using Texas Hold’em as a case study, our experiments demonstrate how Preference CFR can be adjusted to either emphasize customization, prioritizing user preferences,
10:         or to enhance performance, striking a balance between the depth of customization and strategic optimality.
11:     \end{abstract}
12: