Proximal Policy Optimization for Efficient D2D-Assisted Computation Offloading and Resource Allocation in Multi-Access Edge Computing
Chen Zhang,
Celimuge Wu,
Min Lin,
Yangfei Lin and
William Liu ()
Additional contact information
Chen Zhang: College of Computer Science and Technology, Inner Mongolia Normal University, Saihan District, Hohhot 010096, China
Celimuge Wu: Graduate School of Informatics and Engineering, The University of Electro-Communications, Tokyo 1828585, Japan
Min Lin: College of Computer Science and Technology, Inner Mongolia Normal University, Saihan District, Hohhot 010096, China
Yangfei Lin: Graduate School of Informatics and Engineering, The University of Electro-Communications, Tokyo 1828585, Japan
William Liu: School of Computing, Electrical and Applied Technologies, Unitec Institute of Technology, Auckland 1025, New Zealand
Future Internet, 2024, vol. 16, issue 1, 1-17
Abstract:
In the advanced 5G and beyond networks, multi-access edge computing (MEC) is increasingly recognized as a promising technology, offering the dual advantages of reducing energy utilization in cloud data centers while catering to the demands for reliability and real-time responsiveness in end devices. However, the inherent complexity and variability of MEC networks pose significant challenges in computational offloading decisions. To tackle this problem, we propose a proximal policy optimization (PPO)-based Device-to-Device (D2D)-assisted computation offloading and resource allocation scheme. We construct a realistic MEC network environment and develop a Markov decision process (MDP) model that minimizes time loss and energy consumption. The integration of a D2D communication-based offloading framework allows for collaborative task offloading between end devices and MEC servers, enhancing both resource utilization and computational efficiency. The MDP model is solved using the PPO algorithm in deep reinforcement learning to derive an optimal policy for offloading and resource allocation. Extensive comparative analysis with three benchmarked approaches has confirmed our scheme’s superior performance in latency, energy consumption, and algorithmic convergence, demonstrating its potential to improve MEC network operations in the context of emerging 5G and beyond technologies.
Keywords: multi-access edge computing (MEC); 5G networks; Device-to-Device (D2D); proximal policy optimization (PPO); Markov decision process (MDP); computation offloading; collaborative offloading; resource allocation (search for similar items in EconPapers)
JEL-codes: O3 (search for similar items in EconPapers)
Date: 2024
References: View references in EconPapers View complete reference list from CitEc
Citations:
Downloads: (external link)
https://www.mdpi.com/1999-5903/16/1/19/pdf (application/pdf)
https://www.mdpi.com/1999-5903/16/1/19/ (text/html)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:gam:jftint:v:16:y:2024:i:1:p:19-:d:1311991
Access Statistics for this article
Future Internet is currently edited by Ms. Grace You
More articles in Future Internet from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().