1: \begin{abstract}
2: % Semantic communication (SC) employs either task-specific languages or general human language.
3: % can be enabled using either a language learned for specific tasks or task-agnostic human language.
4: In this work, we compare emergent communication (EC) built upon multi-agent deep reinforcement learning (MADRL) and language-oriented semantic communication (LSC) empowered by a pre-trained large language model (LLM) using human language. In a multi-agent remote navigation task, with multimodal input data comprising location and channel maps, it is shown that EC incurs high training cost and struggles when using multimodal data, whereas LSC yields high inference computing cost due to the LLM's large size. To address their respective bottlenecks, we propose a novel framework of language-guided EC (LEC) by guiding the EC training using LSC via knowledge distillation (KD). Simulations corroborate that LEC achieves faster travel time while avoiding areas with poor channel conditions, as well as speeding up the MADRL training convergence by up to 61.8\% compared to EC.
5: % We address a multi-user-multi-modal navigation problem where multiple users learn navigation policies to reach their destinations, while considering environmental conditions and transmit power constraints. We propose three distinct methods: Emergent Communication (EC), Language-oriented Semantic Communication (LSC), and Language-aided Emergent Communication (LEC). EC operates within a multi-agent deep reinforcement learning (MADRL) framework, leveraging message exchanges between agents to facilitate decision-making. LSC employs Large Language Models (LLMs) and image-to-text generative models to guide navigation through linguistic instructions. LEC, on the other hand, applies knowledge distillation from LSC into EC, to integrate the strengths of both EC and LSC.
6: \end{abstract}
7: