EconPapers    
Economics at your fingertips  
 

A Survey on Population-Based Deep Reinforcement Learning

Weifan Long, Taixian Hou, Xiaoyi Wei, Shichao Yan, Peng Zhai () and Lihua Zhang ()
Additional contact information
Weifan Long: Academy for Engineering and Technology, Fudan University, Shanghai 200433, China
Taixian Hou: Academy for Engineering and Technology, Fudan University, Shanghai 200433, China
Xiaoyi Wei: Academy for Engineering and Technology, Fudan University, Shanghai 200433, China
Shichao Yan: Academy for Engineering and Technology, Fudan University, Shanghai 200433, China
Peng Zhai: Academy for Engineering and Technology, Fudan University, Shanghai 200433, China
Lihua Zhang: Academy for Engineering and Technology, Fudan University, Shanghai 200433, China

Mathematics, 2023, vol. 11, issue 10, 1-17

Abstract: Many real-world applications can be described as large-scale games of imperfect information, which require extensive prior domain knowledge, especially in competitive or human–AI cooperation settings. Population-based training methods have become a popular solution to learn robust policies without any prior knowledge, which can generalize to policies of other players or humans. In this survey, we shed light on population-based deep reinforcement learning (PB-DRL) algorithms, their applications, and general frameworks. We introduce several independent subject areas, including naive self-play, fictitious self-play, population-play, evolution-based training methods, and the policy-space response oracle family. These methods provide a variety of approaches to solving multi-agent problems and are useful in designing robust multi-agent reinforcement learning algorithms that can handle complex real-life situations. Finally, we discuss challenges and hot topics in PB-DRL algorithms. We hope that this brief survey can provide guidance and insights for researchers interested in PB-DRL algorithms.

Keywords: reinforcement learning; multi-agent reinforcement learning; self play; population play (search for similar items in EconPapers)
JEL-codes: C (search for similar items in EconPapers)
Date: 2023
References: View references in EconPapers View complete reference list from CitEc
Citations:

Downloads: (external link)
https://www.mdpi.com/2227-7390/11/10/2234/pdf (application/pdf)
https://www.mdpi.com/2227-7390/11/10/2234/ (text/html)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:gam:jmathe:v:11:y:2023:i:10:p:2234-:d:1143662

Access Statistics for this article

Mathematics is currently edited by Ms. Emma He

More articles in Mathematics from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().

 
Page updated 2025-03-19
Handle: RePEc:gam:jmathe:v:11:y:2023:i:10:p:2234-:d:1143662