abstract:a1c05cd05cdefeb5.tex

1: \begin{abstract}

2: Stochastic gradient descent based algorithms are typically used as the general  optimization tools for most deep learning models.

3: A Restricted Boltzmann Machine (RBM) is a probabilistic generative model that can be stacked to construct deep architectures.

4: For RBM with Bernoulli inputs, non-Euclidean algorithm such as stochastic spectral descent (SSD) has been specifically designed to speed up the convergence with improved use of the gradient estimation by sampling methods.

5: However, the existing algorithm and corresponding theoretical justification depend on the assumption that the possible configurations of inputs are finite, like binary variables.

6: The purpose of this paper is to generalize SSD for Gaussian RBM being capable of modeling continuous data, regardless of the previous assumption.

7: We propose the gradient descent methods in non-Euclidean space of parameters, via deriving the upper bounds of logarithmic partition function for RBMs based on Schatten-$\infty$ norm.

8: We empirically show that the advantage and improvement of SSD over stochastic gradient descent (SGD).

9: \end{abstract}

10: