Machine Learning-Based Resource Allocation Algorithm to Mitigate Interference in D2D-Enabled Cellular Networks
Md Kamruzzaman,
Nurul I. Sarkar () and
Jairo Gutierrez
Additional contact information
Md Kamruzzaman: Computer Science and Software Engineering, Auckland University of Technology, Auckland 1010, New Zealand
Nurul I. Sarkar: Computer Science and Software Engineering, Auckland University of Technology, Auckland 1010, New Zealand
Jairo Gutierrez: Computer Science and Software Engineering, Auckland University of Technology, Auckland 1010, New Zealand
Future Internet, 2024, vol. 16, issue 11, 1-26
Abstract:
Mobile communications have experienced exponential growth both in connectivity and multimedia traffic in recent years. To support this tremendous growth, device-to-device (D2D) communications play a significant role in 5G and beyond 5G networks. However, enabling D2D communications in an underlay, heterogeneous cellular network poses two major challenges. First, interference management between D2D and cellular users directly affects a system’s performance. Second, achieving an acceptable level of link quality for both D2D and cellular networks is necessary. An optimum resource allocation is required to mitigate the interference and improve a system’s performance. In this paper, we provide a solution to interference management with an acceptable quality of services (QoS). To this end, we propose a machine learning-based resource allocation method to maximize throughput and achieve minimum QoS requirements for all active D2D pairs and cellular users. We first solve a resource optimization problem by allocating spectrum resources and controlling power transmission on demand. As resource optimization is an integer nonlinear programming problem, we address this problem by proposing a deep Q-network-based reinforcement learning algorithm (DRL) to optimize the resource allocation issue. The proposed DRL algorithm is trained with a decision-making policy to obtain the best solution in terms of spectrum efficiency, computational time, and throughput. The system performance is validated by simulation. The results show that the proposed method outperforms the existing ones.
Keywords: machine learning (ML); device-to-device (D2D); resource allocation (RA); reinforcement learning (RL); Markov Decision Process (MDP); deep reinforcement learning (DRL); ultra-dense network (UDN); heterogeneous networks (HetNets) (search for similar items in EconPapers)
JEL-codes: O3 (search for similar items in EconPapers)
Date: 2024
References: View complete reference list from CitEc
Citations:
Downloads: (external link)
https://www.mdpi.com/1999-5903/16/11/408/pdf (application/pdf)
https://www.mdpi.com/1999-5903/16/11/408/ (text/html)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:gam:jftint:v:16:y:2024:i:11:p:408-:d:1514963
Access Statistics for this article
Future Internet is currently edited by Ms. Grace You
More articles in Future Internet from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().