2501a2d902c65e23.tex
1: \begin{abstract}
2: Semi-grant-free non-orthogonal multiple access (semi-GF NOMA) has emerged as a promising technology for the fifth-generation new radio (5G-NR) networks supporting the coexistence of a large number of random connections with various quality of service requirements. 
3: However, implementing a semi-GF NOMA mechanism in 5G-NR networks with heterogeneous services has raised several resource management problems relating to unpredictable interference caused by the GF access strategy. 
4: To cope with this challenge, the paper develops a novel hybrid optimization and multi-agent deep (HOMAD) reinforcement learning-based resource allocation design to maximize the energy efficiency (EE) of semi-GF NOMA 5G-NR systems.
5: In this design, a multi-agent deep Q network (MADQN) approach is employed to conduct the subchannel assignment (SA) among users.
6: While optimization-based methods are utilized to optimize the transmission power for every SA setting. 
7: In addition, a full MADQN scheme conducting both SA and power allocation is also considered for comparison purposes. 
8: Simulation results show that the HOMAD approach outperforms other benchmarks significantly in terms of the convergence time and average EE.
9: % This indicates that the combination of MADQN and convex optimization is a promising solution for SGF-NOMA systems that require the resource block and power allocation strategies to be implemented in a timely and accurate manner over time-varying wireless medium.
10: 
11: \end{abstract}
12: