1: \begin{abstract}
2: We~consider the automatic online synthesis of black-box test cases from functional requirements specified as automata for reactive implementations.
3: The goal of the tester is to reach some given state, so as to satisfy a coverage criterion,
4: while monitoring the violation of the requirements.
5: We develop an approach based on Monte Carlo Tree Search, which is a classical
6: technique in reinforcement learning for efficiently selecting promising inputs.
7: Seeing the automata requirements as a game between the implementation and the tester,
8: we develop a heuristic by biasing the search towards inputs that are promising in this game.
9: We experimentally show that our heuristic accelerates the convergence of the Monte Carlo Tree Search algorithm,
10: thus improving the performance of testing.
11: \end{abstract}
12: