1: \begin{abstract}
2: Beliefs inform the behavior of forward-thinking agents in complex environments.
3: Recently, sequential Bayesian inference has emerged as a mechanism to study belief formation among agents adapting to dynamical conditions.
4: However, we lack critical theory to explain how preferences evolve in cases of simple agent interactions.
5: In this paper, we derive a Gaussian, pairwise agent interaction model to study how preferences converge when driven by observation of each other's behaviors.
6: We show that the dynamics of convergence resemble an Ornstein-Uhlenbeck process, a common model in nonequilibrium stochastic dynamics.
7: Using standard analytical and computational techniques,
8: we find that the hyperprior magnitudes, representing the learning time, determine the convergence value and the asymptotic entropy of the preferences across pairs of agents.
9: We also show that the dynamical variance in preferences is characterized by a relaxation time $t^\star$, and compute its asymptotic upper bound.
10: This formulation enhances the existing toolkit for modeling stochastic, interactive agents by formalizing leading theories in learning theory, and builds towards more comprehensive models of open problems in principal-agent and market theory.
11: \end{abstract}
12: