Harnessing Online Knowledge Transfer for Enhanced Search and Rescue Decisions via Multi-Agent Reinforcement Learning

Song, Luona; Wen, Zhigang; Teng, Junjie; Zhang, Jian; Nicolas, Merveille

Harnessing Online Knowledge Transfer for Enhanced Search and Rescue Decisions via Multi-Agent Reinforcement Learning

Luona Song (), Zhigang Wen, Junjie Teng, Jian Zhang and Merveille Nicolas ()
Additional contact information
Luona Song: School of Economics and Management, Beijing Information Science and Technology University, Beijing 100192, China
Zhigang Wen: School of Electronic Engineering, Beijing University of Posts and Telecommunications, Beijing 100876, China
Junjie Teng: School of Electronic Engineering, Beijing University of Posts and Telecommunications, Beijing 100876, China
Jian Zhang: School of Economics and Management, Beijing Information Science and Technology University, Beijing 100192, China
Merveille Nicolas: Department of Strategy and Social and Environmental Responsibility, Université du Québec à Montréal, Montréal, QC H3C 3P8, Canada

Sustainability, 2023, vol. 15, issue 24, 1-18

Abstract: In the rapidly evolving domain of the Internet of Things (IoT), devices play an instrumental role in high-stakes scenarios like search and rescue (SAR) operations. Traditional decision-making processes within SAR missions often struggle to cope with the dynamic and unpredictable nature of such environments, leading to inefficiencies and delayed responses. This paper aims to explore the potential of multi-agent reinforcement learning (MARL) to improve the decision-making process within SAR operations underpinned by IoT. Functional, current methods are limited by their static decision frameworks and inability to adapt in real time to the chaotic variables present in SAR situations. We introduced a novel MARL framework and compared its performance against benchmark strategies, specifically the multi-agent deep deterministic policy gradient (MADDPG) approach. Uniquely enhanced by online knowledge transfer, the framework leverages the capabilities of the deep deterministic policy gradient (DDPG) method. The preliminary findings underscore the proposed framework’s superior efficiency and speed in SAR contexts. Our research highlights MARL’s transformative potential, positing it as a groundbreaking strategy for IoT-based decision making in high-pressure SAR environments with suggestions for further studies in varied real-world scenarios.

Keywords: search and rescue (SAR); Internet of Things (IoT); deep deterministic policy gradient (DDPG); online knowledge transfer; soft target generation technique; cooperative games; competitive games (search for similar items in EconPapers)
JEL-codes: O13 Q Q0 Q2 Q3 Q5 Q56 (search for similar items in EconPapers)
Date: 2023
References: View complete reference list from CitEc
Citations:

Downloads: (external link)
https://www.mdpi.com/2071-1050/15/24/16741/pdf (application/pdf)
https://www.mdpi.com/2071-1050/15/24/16741/ (text/html)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:gam:jsusta:v:15:y:2023:i:24:p:16741-:d:1298114

Access Statistics for this article

Sustainability is currently edited by Ms. Alexandra Wu

More articles in Sustainability from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().