abstract:630c9b5ad848e35e.tex

1: \begin{abstract}

2: This paper presents \alg (Accelerated Graduated Generalized LInear-model Optimization), a stage-wise, graduated optimization technique that offers global convergence guarantees for non-convex optimization problems whose objectives offer only local convexity and may fail to be even quasi-convex at a global scale. In particular, this includes learning problems that utilize popular activation functions such as sigmoid, softplus and SiLU that yield non-convex training objectives. \alg can be readily implemented using point as well as mini-batch SGD updates and offers provable convergence to the global optimum in general conditions. In experiments, \alg outperformed several recently proposed optimization techniques for non-convex and locally convex objectives in terms of convergence rate as well as convergent accuracy. \alg relies on a graduation technique for generalized linear models, as well as a novel proof strategy, both of which may be of independent interest. Code for \alg is available at the following \codecite.

3: \end{abstract}

4: