1: \begin{abstract}
2: Stochastic gradient descent based algorithms are typically used as the general optimization tools for most deep learning models.
3: A Restricted Boltzmann Machine (RBM) is a probabilistic generative model that can be stacked to construct deep architectures.
4: For RBM with Bernoulli inputs, non-Euclidean algorithm such as stochastic spectral descent (SSD) has been specifically designed to speed up the convergence with improved use of the gradient estimation by sampling methods.
5: However, the existing algorithm and corresponding theoretical justification depend on the assumption that the possible configurations of inputs are finite, like binary variables.
6: The purpose of this paper is to generalize SSD for Gaussian RBM being capable of modeling continuous data, regardless of the previous assumption.
7: We propose the gradient descent methods in non-Euclidean space of parameters, via deriving the upper bounds of logarithmic partition function for RBMs based on Schatten-$\infty$ norm.
8: We empirically show that the advantage and improvement of SSD over stochastic gradient descent (SGD).
9: \end{abstract}
10: