1: \begin{abstract}
2: % In this paper, we consider the problem of sequential Bayesian experimental design for non-exchangeable data, often arising in the context of dynamical systems. Recent advances in this domain have brought forth several techniques that rely on biased estimators of the expected information gain (EIG) to amortize the cost of experiments by learning a design policy in advance. We propose a novel approach that formulates amortized sequential Bayesian design as risk-sensitive policy optimization with an inherent bias-variance trade-off mechanism and asymptotic convergence guarantees. To this end, we develop the Inside-Out SMC\textsuperscript{2} algorithm that uses a nested sequential Monte Carlo (SMC) estimator of the expected information gain and embeds it into a particle Markov chain Monte Carlo (pMCMC) framework to perform gradient-based policy optimization. Numerical validation on a set of dynamical systems showcases the efficacy of our method in comparison to other state-of-the-art strategies.
3: In this paper, we propose a novel approach to Bayesian Experimental Design (BED) for non-exchangeable data that formulates it as risk-sensitive policy optimization.
4: We develop the Inside-Out SMC\textsuperscript{2} algorithm that uses a nested sequential Monte Carlo (SMC) estimator of the expected information gain and embeds it into a particle Markov chain Monte Carlo (pMCMC) framework to perform gradient-based policy optimization.
5: This is in contrast to recent approaches that rely on biased estimators of the expected information gain (EIG) to amortize the cost of experiments by learning a design policy in advance.
6: Numerical validation on a set of dynamical systems showcases the efficacy of our method in comparison to other state-of-the-art strategies.
7: \end{abstract}
8: