ca3ac8f22c544c86.tex
1: \begin{abstract}
2: In this paper, we show how $K$-nearest neighbor ($K$-NN) resampling, an off-policy evaluation method proposed in \cite{giegrich2023k}, can be applied to simulate limit order book (LOB) markets and how it can be used to evaluate and calibrate trading strategies. Using historical LOB data, we demonstrate that our simulation method is capable of recreating realistic LOB dynamics and that synthetic trading within the simulation leads to a market impact in line with the corresponding literature. Compared to other statistical LOB simulation methods, our algorithm has theoretical convergence guarantees under general conditions, does not require optimization, is easy to implement and computationally efficient. Furthermore, we show that in a benchmark comparison our method outperforms a deep learning-based algorithm for several key statistics. In the context of a LOB with pro-rata type matching, we demonstrate how our algorithm can calibrate the size of limit orders for a liquidation strategy. Finally, we describe how $K$-NN resampling can be modified for choices of higher dimensional state spaces. 
3: %Central limit order books (LOBs) are the prevalent organisational mechanism for trading in a wide range of assets. Thus, finding good trading strategies in such environments is critical for market participants, while the testing of trading strategies in a live market  is risky and may lead to sizeable losses. Off-policy evaluation, a subfield of reinforcement learning, deals with the problem of evaluating strategies using observational data instead.
4: \end{abstract}
5: