abstract:aecc5d03d666ba22.tex

1: \begin{abstract} Convexity of the objective function often allows to guarantee much better convergence rates of iterative minimization methods than in the general non-convex case. However, many problems encountered in training neural networks are non-convex. Some of them satisfy conditions weaker than convexity, but which are still sufficient to guarantee the convergence of some first-order methods.

2:

3: In this work we present a condition to replace convexity and show that gradient descent with fixed step length retains its convergence rate under this condition. We show that the sequential subspace optimization method is optimal in terms of oracle complexity in this case. We also provide a substitute for strong convexity which is sufficient to

4: guarantee the same convergence rate as in the strongly convex case for this new class of generally non-convex functions.

5: \keywords{Non-convex minimization\and First-order methods\and Accelerated methods\and Global optimization}

6: %\subclass{90C26}

7: \end{abstract}

8: