5e83026ad472443c.tex
1: \begin{abstract}
2: In this paper, we propose a novel accelerated gradient method called \anita for solving the fundamental finite-sum optimization problems.
3: Concretely, we consider both general convex and strongly convex settings:
4: i) For general convex finite-sum problems, \anita improves previous state-of-the-art result given by Varag \citep{lan2019unified}.
5: In particular, for large-scale problems or the convergence error is not very small, i.e., $n \geq \frac{1}{\epsilon^2}$, \anita obtains the \emph{first} optimal result $O(n)$, matching the lower bound $\Omega(n)$ provided by \citet{woodworth2016tight}, while previous results are $O(n \log \frac{1}{\epsilon})$ of Varag \citep{lan2019unified} and $O(\frac{n}{\sqrt{\epsilon}})$ of Katyusha \citep{allen2017katyusha}.
6: ii) For strongly convex finite-sum problems, we also show that \anita can achieve the optimal convergence rate $O\big((n+\sqrt{\frac{nL}{\mu}})\log\frac{1}{\epsilon}\big)$ matching the lower bound $\Omega\big((n+\sqrt{\frac{nL}{\mu}})\log\frac{1}{\epsilon}\big)$ provided by \citet{lan2015optimal}.
7: Besides, \anita enjoys a simpler loopless algorithmic structure unlike previous accelerated algorithms such as Varag \citep{lan2019unified} and Katyusha \citep{allen2017katyusha} where they use double-loop structures. Moreover, we provide a novel \emph{dynamic multi-stage convergence analysis}, which is the key technical part for improving previous results to the optimal rates. 
8: We believe that our new theoretical rates and novel convergence analysis for the fundamental finite-sum problem will directly lead to key improvements for many other related problems, such as distributed/federated/decentralized optimization problems (e.g., \citealp{li2021canita}).
9: Finally, the numerical experiments show that \anita converges faster than the previous state-of-the-art Varag \citep{lan2019unified}, validating our theoretical results and confirming the practical superiority of \anita.
10: \end{abstract}
11: