1: \begin{abstract}
2: Large Language Models (LLMs) are transforming the landscape of mobile intelligence. Federated Learning (FL), a method to preserve user data privacy, is often employed in fine-tuning LLMs to downstream mobile tasks, i.e., FedLLM.
3: %Though recent efforts have addressed the network issue induced by the vast model size, they have not practically mitigated vital challenges concerning integration with mobile devices, such as significant memory consumption and sluggish model convergence.
4: A vital challenge of FedLLM is the tension between LLM complexity and resource constraint of mobile devices.
5:
6: In response to this challenge, this work introduces \sys\footnote{\sys can be accessed at \url{https://github.com/UbiquitousLearning/FwdLLM.git}}, an innovative FL protocol designed to enhance the FedLLM efficiency.
7: The key idea of \sys is to employ backpropagation (BP)-free training methods, requiring devices only to execute ``perturbed inferences''. Consequently, \sys delivers way better memory efficiency and time efficiency (expedited by mobile NPUs and an expanded array of participant devices).
8: \sys centers around three key designs:
9: (1) it combines BP-free training with parameter-efficient training methods, an essential way to scale the approach to the LLM era;
10: (2) it systematically and adaptively allocates computational loads across devices, striking a careful balance between convergence speed and accuracy;
11: (3) it discriminatively samples perturbed predictions that are more valuable to model convergence.
12: Comprehensive experiments illustrate \sys's significant advantages over conventional methods, including up to three orders of magnitude faster convergence and a 14.6$\times$ reduction in memory footprint. Uniquely, \sys paves the way for federated billion-parameter LLMs such as LLaMA on COTS mobile devices -- a feat previously unattained.
13:
14: \end{abstract}
15: