1: \begin{abstract}% <- trailing '%' for backward compatibility of .sty file
2: In this paper, We propose a general Riemannian proximal optimization
3: algorithm with guaranteed convergence to solve Markov decision process
4: (MDP) problems. To model policy functions in MDP, we employ Gaussian
5: mixture model (GMM) and formulate it as a non-convex optimization
6: problem in the Riemannian space of positive semidefinite matrices.
7: For two given policy functions, we also provide its lower bound on
8: policy improvement by using bounds derived from the Wasserstein distance
9: of GMMs. Preliminary experiments show the efficacy of our proposed
10: Riemannian proximal policy optimization algorithm.
11: \end{abstract}
12: