abstract:6ebd7a8b987dd596.tex

1: \begin{abstract}

2:     In recent years, over-the-air aggregation has been widely considered in large-scale distributed learning, optimization, and sensing.

3:     In this paper, we propose an over-the-air federated policy gradient algorithm, where all agents simultaneously broadcast an analog signal carrying local information to a common wireless channel, and a central controller uses the received aggregated waveform to update the policy parameters.

4:     We investigate the effect of noise and channel distortion on the convergence of the proposed algorithm, and establish the complexities of communication and sampling for finding an $\epsilon$-approximate stationary point. Finally, we present some simulation results to show the effectiveness of the algorithm.

5: \end{abstract}

6: