1: \begin{abstract}
2: Recent advancements in artificial intelligence (AI) have leveraged large-scale games as benchmarks to gauge progress,
3: with AI now frequently outperforming human capabilities.
4: Traditionally, this success has largely relied on solving Nash equilibrium (NE) using variations of the counterfactual regret minimization (CFR) method in games with incomplete information.
5: However, the variety of Nash equilibria has been largely overlooked in previous research, limiting the adaptability of AI to meet diverse human preferences.
6: To address this challenge, where AI is powerful but struggles to meet customization needs, we introduce a novel approach: Preference-CFR, which incorporates two new parameters: preference degree and vulnerability degree.
7: These parameters allow for greater flexibility in AI strategy development without compromising convergence.
8: Our method significantly alters the distribution of final strategies, enabling the creation of customized AI models that better align with individual user needs.
9: Using Texas Hold’em as a case study, our experiments demonstrate how Preference CFR can be adjusted to either emphasize customization, prioritizing user preferences,
10: or to enhance performance, striking a balance between the depth of customization and strategic optimality.
11: \end{abstract}
12: