abstract:951a38109cdd4291.tex

1: \begin{abstract}%   <- trailing '%' for backward compatibility of .sty file

2: In this paper, We propose a general Riemannian proximal optimization

3: algorithm with guaranteed convergence to solve Markov decision process

4: (MDP) problems. To model policy functions in MDP, we employ Gaussian

5: mixture model (GMM) and formulate it as a non-convex optimization

6: problem in the Riemannian space of positive semidefinite matrices.

7: For two given policy functions, we also provide its lower bound on

8: policy improvement by using bounds derived from the Wasserstein distance

9: of GMMs. Preliminary experiments show the efficacy of our proposed

10: Riemannian proximal policy optimization algorithm.

11: \end{abstract}

12: