607b056ddd42260f.tex
1: \begin{abstract}
2: In this work, we propose a novel adaptive stochastic gradient-free (ASGF) approach for solving high-dimensional nonconvex optimization problems based on function evaluations.
3: We employ a directional Gaussian smoothing of the target function that generates a surrogate of the gradient and assists in avoiding bad local optima by utilizing nonlocal information of the loss landscape.
4: Applying a deterministic quadrature scheme results in a massively scalable technique that is sample-efficient and achieves spectral accuracy.
5: At each step we randomly generate the search directions while primarily following the surrogate of the smoothed gradient.
6: This enables exploitation of the gradient direction while maintaining sufficient space exploration, and accelerates convergence towards the global extrema.
7: In addition, we make use of a local approximation of the Lipschitz constant in order to adaptively adjust the values of all hyperparameters, thus removing the careful fine-tuning of current algorithms that is often necessary to be successful when applied to a large class of learning tasks.
8: As such, the ASGF strategy offers significant improvements when solving high-dimensional nonconvex optimization problems when compared to other gradient-free methods (including the so called "evolutionary strategies'') as well as iterative approaches that rely on the gradient information of the objective function.
9: We illustrate the improved performance of this method by providing several comparative numerical studies on benchmark global optimization problems and reinforcement learning tasks.
10: \end{abstract}
11: