1: \begin{abstract}
2: Over-the-air computation is a communication-efficient solution for federated learning (FL). In such a system, iterative procedure is performed: Local gradient of private loss function is updated, amplified and then transmitted by every mobile device; the server receives the aggregated gradient all-at-once, generates and then broadcasts updated model parameters to every mobile device.
3: {
4: In terms of amplification factor selection, most related works suppose the local gradient's maximal norm always happens although it actually fluctuates over iterations, which may degrade convergence performance.}
5: To circumvent this problem, we propose to turn local gradient to be normalized one before amplifying it.
6: Under our proposed method,
7: when the loss function is smooth, we prove our proposed method can converge to stationary point at sub-linear rate.
8: In case of smooth and strongly convex loss function, we prove our proposed method can achieve minimal training loss at linear rate with any small positive tolerance. Moreover, a tradeoff between convergence rate and the tolerance is discovered.
9: To speedup convergence, problems optimizing system parameters are also formulated for above two cases.
10: Although being non-convex, optimal solution with polynomial complexity of the formulated problems are derived.
11: Experimental results show our proposed method can outperform benchmark methods on convergence performance.
12: \end{abstract}
13: