1: \begin{abstract}
2: Non-linear state-space models, also known as general hidden Markov models, are ubiquitous in statistical machine learning, being the most classical generative models for serial data and sequences in general.
3: The particle-based, rapid incremental smoother (\PARIS) is a sequential Monte Carlo (SMC) technique allowing for efficient online approximation of expectations of additive functionals under the smoothing distribution in these models.
4: Such expectations appear naturally in several learning contexts, such as likelihood estimation (MLE) and Markov score climbing (MSC). {\PARIS} has linear computational complexity, limited memory requirements and comes with non-asymptotic bounds, convergence results and stability guarantees.
5: Still, being based on self-normalised importance sampling, the {\PARIS} estimator is biased.
6: Our first contribution is to design a novel additive smoothing algorithm, the Parisian particle Gibbs (\PPG) sampler, which can be viewed as a {\PARIS} algorithm driven by conditional SMC moves, resulting in bias-reduced estimates of the targeted quantities. We substantiate the {\PPG} algorithm with theoretical results, including new bounds on bias and variance as well as deviation inequalities.
7: Our second contribution is to apply {\PPG} in a learning framework, covering MLE and MSC as special examples. In this context, we establish, under standard assumptions, non-asymptotic bounds highlighting the value of bias reduction and the implicit Rao--Blackwellization of {\PPG}. These are the first non-asymptotic results of this kind in this setting.
8: We illustrate our theoretical results with numerical experiments supporting our claims.
9: \end{abstract}
10: