abstract:214933d1e65bb668.tex

1: \begin{abstract}

2: Deep unfolding is a promising deep-learning technique, whose network architecture is based on expanding the recursive structure of {existing iterative algorithms.}

3: Although convergence acceleration is a remarkable advantage of deep unfolding, its theoretical aspects have not been revealed yet.

4: The first half of this study details the theoretical analysis of the convergence acceleration in deep-unfolded gradient descent (DUGD) whose trainable parameters are step sizes.

5: We propose a plausible interpretation of the learned step-size parameters in DUGD by introducing the principle of Chebyshev steps derived from Chebyshev polynomials.

6: The use of Chebyshev steps in gradient descent (GD) enables us to bound the spectral radius

7: of a matrix governing the convergence speed of GD, leading to a tight upper bound on the convergence rate.

8: The convergence rate of GD using Chebyshev steps is shown to be asymptotically

9: optimal, although it has no momentum terms.

10: We also show that Chebyshev steps numerically explain

11:  the learned step-size parameters in DUGD well.

12: In the second half of the study, %we apply the theory of Chebyshev steps {and}

13: %developed in the first half to fixed-point iterations.

14:  Chebyshev-periodical successive over-relaxation (Chebyshev-PSOR) is proposed

15: for accelerating linear/nonlinear fixed-point iterations.

16: Theoretical analysis

17: % using linear approximation {around a fixed point

18: %reveals the local convergence behavior.}

19: {and numerical} %Numerical

20: experiments indicate that Chebyshev-PSOR exhibits significantly faster convergence for various examples such as Jacobi method and proximal gradient methods. %, and Landweber iteration.

21: \end{abstract}

22: