abstract:82e23c064d3c2d31.tex

1: \begin{abstract}

2: We present a novel adaptive optimization algorithm for large-scale machine learning problems.

3: Equipped with a low-cost estimate of local curvature and Lipschitz smoothness, our method dynamically adapts the search direction and step-size.

4: The search direction contains gradient information preconditioned by a well-scaled diagonal preconditioning matrix that captures the local curvature information.

5: Our methodology does not require the tedious task of learning rate tuning, as the learning rate is updated automatically without adding an extra hyperparameter.

6: We provide convergence guarantees on a comprehensive collection of optimization problems, including convex, strongly convex, and nonconvex problems, in both deterministic and stochastic regimes.

7: We also conduct an extensive empirical evaluation on standard machine learning problems, justifying our algorithm's versatility and demonstrating its strong performance compared to other start-of-the-art first-order and second-order methods.

8: \end{abstract}

9: