abstract:a1cf0f741eb4b92d.tex

1: \begin{abstract}

2:         Recent advancements in artificial intelligence (AI) have leveraged large-scale games as benchmarks to gauge progress,

3:         with AI now frequently outperforming human capabilities.

4:         Traditionally, this success has largely relied on solving Nash equilibrium (NE) using variations of the counterfactual regret minimization (CFR) method in games with incomplete information.

5:         However, the variety of Nash equilibria has been largely overlooked in previous research, limiting the adaptability of AI to meet diverse human preferences.

6:         To address this challenge, where AI is powerful but struggles to meet customization needs, we introduce a novel approach: Preference-CFR, which incorporates two new parameters: preference degree and vulnerability degree.

7:         These parameters allow for greater flexibility in AI strategy development without compromising convergence.

8:         Our method significantly alters the distribution of final strategies, enabling the creation of customized AI models that better align with individual user needs.

9:         Using Texas Hold’em as a case study, our experiments demonstrate how Preference CFR can be adjusted to either emphasize customization, prioritizing user preferences,

10:         or to enhance performance, striking a balance between the depth of customization and strategic optimality.

11:     \end{abstract}

12: