214933d1e65bb668.tex
1: \begin{abstract}
2: Deep unfolding is a promising deep-learning technique, whose network architecture is based on expanding the recursive structure of {existing iterative algorithms.} 
3: Although convergence acceleration is a remarkable advantage of deep unfolding, its theoretical aspects have not been revealed yet.
4: The first half of this study details the theoretical analysis of the convergence acceleration in deep-unfolded gradient descent (DUGD) whose trainable parameters are step sizes.
5: We propose a plausible interpretation of the learned step-size parameters in DUGD by introducing the principle of Chebyshev steps derived from Chebyshev polynomials.  
6: The use of Chebyshev steps in gradient descent (GD) enables us to bound the spectral radius 
7: of a matrix governing the convergence speed of GD, leading to a tight upper bound on the convergence rate.
8: The convergence rate of GD using Chebyshev steps is shown to be asymptotically 
9: optimal, although it has no momentum terms. 
10: We also show that Chebyshev steps numerically explain
11:  the learned step-size parameters in DUGD well.
12: In the second half of the study, %we apply the theory of Chebyshev steps {and}
13: %developed in the first half to fixed-point iterations.
14:  Chebyshev-periodical successive over-relaxation (Chebyshev-PSOR) is proposed 
15: for accelerating linear/nonlinear fixed-point iterations.
16: Theoretical analysis 
17: % using linear approximation {around a fixed point 
18: %reveals the local convergence behavior.}
19: {and numerical} %Numerical 
20: experiments indicate that Chebyshev-PSOR exhibits significantly faster convergence for various examples such as Jacobi method and proximal gradient methods. %, and Landweber iteration. 
21: \end{abstract}
22: