c4ecaf3975f5b82e.tex
1: \begin{abstract}
2: The rise of mobile devices with abundant sensory data and local computing capabilities has driven the trend of federated learning (FL) on these devices. 
3: And personalized FL (PFL) emerges to train specific deep models for each mobile device to address data heterogeneity and varying performance preferences.
4: However, mobile training times vary significantly, resulting in either delay (when waiting for slower devices for aggregation) or accuracy decline (when aggregation proceeds without waiting).
5: In response, we propose a shift towards asynchronous PFL, where the server aggregates updates as soon as they are available.
6: Nevertheless, existing asynchronous protocols are unfit for PFL because they are devised for federated training of a single global model. 
7: They suffer from slow convergence and decreased accuracy when confronted with severe data heterogeneity prevalent in PFL. 
8: Furthermore, they often exclude slower devices for staleness control, which notably compromises accuracy when these devices possess critical personalized data.
9: %
10: Therefore, we propose \sysname, a coordination mechanism for asynchronous PFL. 
11: Central to \sysname is to include updates from all mobile devices regardless of their latency. 
12: To cope with the inevitable staleness from slow devices, \sysname revisits model broadcasting. 
13: It intelligently converts the unscalable broadcast to \textit{on-demand broadcast}, leveraging the \textit{asymmetrical bandwidth} in wireless networks and the dynamic clustering-based PFL. 
14: Experiments show that compared to status quo approaches, \sysname achieves a reduction of up to 88.2\% in convergence time, an improvement of up to 46\% in accuracy, and a decrease of 37\% in communication costs.
15: \end{abstract}
16: