This paper addresses the problem of joint multiplexing of enhanced Mobile Broadband (eMBB) and massive Machine-Type Communications (mMTC) traffic in the same uplink time- frequency RG. Given the challenge posed by a potentially large number of users, it is essential to focus on a multiple access strategy that leverages artificial intelligence to adapt to specific channel conditions. An mMTC agent is developed through a Deep Reinforcement Learning (DRL) methodology for generating grant-free frequency hopping traffic in a decentralized manner, assuming the presence of underlying eMBB traffic dynamics. Within this DRL framework, a methodical comparison between two possible deep neural networks is conducted, using different generative models employed to ascertain their intrinsic capabilities in various application scenarios. The analysis conducted reveals that the Long Short-Term Memory network is particularly suitable for the required task, demonstrating a robustness that is consistently very close to potential upper-bounds, despite the latter requiring complete knowledge of the underlying statistics.

Decentralized Grant-Free mMTC Traffic Multiplexing with eMBB Data through Deep Reinforcement Learning

Giovanni Di Gennaro
;
Amedeo Buonanno;Gianmarco Romano;Francesco Palmieri
2024

Abstract

This paper addresses the problem of joint multiplexing of enhanced Mobile Broadband (eMBB) and massive Machine-Type Communications (mMTC) traffic in the same uplink time- frequency RG. Given the challenge posed by a potentially large number of users, it is essential to focus on a multiple access strategy that leverages artificial intelligence to adapt to specific channel conditions. An mMTC agent is developed through a Deep Reinforcement Learning (DRL) methodology for generating grant-free frequency hopping traffic in a decentralized manner, assuming the presence of underlying eMBB traffic dynamics. Within this DRL framework, a methodical comparison between two possible deep neural networks is conducted, using different generative models employed to ascertain their intrinsic capabilities in various application scenarios. The analysis conducted reveals that the Long Short-Term Memory network is particularly suitable for the required task, demonstrating a robustness that is consistently very close to potential upper-bounds, despite the latter requiring complete knowledge of the underlying statistics.
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11591/538808
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact