EconPapers    
Economics at your fingertips  
 

Population-based exploration in reinforcement learning through repulsive reward shaping using eligibility traces

Melis Ilayda Bal (), Cem Iyigun (), Faruk Polat () and Huseyin Aydin ()
Additional contact information
Melis Ilayda Bal: Max Planck Institute for Software Systems
Cem Iyigun: Middle East Technical University
Faruk Polat: Middle East Technical University
Huseyin Aydin: Middle East Technical University

Annals of Operations Research, 2025, vol. 347, issue 2, No 11, 1059-1091

Abstract: Abstract Efficient exploration plays a key role in accelerating the learning performance and sample efficiency of reinforcement learning tasks. In this paper we propose a framework that serves as a population-based repulsive reward shaping mechanism using eligibility traces to enhance the efficiency in exploring the state-space under the scope of tabular reinforcement learning representation. The framework contains a hierarchical structure of RL agents, where a higher level repulsive-reward-shaper agent (RRS-Agent) coordinates the exploration of its population of sub-agents through repulsion when necessary conditions on their eligibility traces are met. Empirical results on well-known benchmark problem domains show that the framework indeed achieves efficient exploration with a significant improvement in learning performance and state-space coverage. Furthermore, the transparency of the proposed framework enables explainable decisions made by the agents in the hierarchical structure to explore the state-space in a coordinated manner and supports the interpretability of the framework.

Keywords: Reinforcement learning; Population-based exploration; Eligibility traces; Reward shaping; Coordinated agents (search for similar items in EconPapers)
Date: 2025
References: Add references at CitEc
Citations:

Downloads: (external link)
http://link.springer.com/10.1007/s10479-023-05798-1 Abstract (text/html)
Access to the full text of the articles in this series is restricted.

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:spr:annopr:v:347:y:2025:i:2:d:10.1007_s10479-023-05798-1

Ordering information: This journal article can be ordered from
http://www.springer.com/journal/10479

DOI: 10.1007/s10479-023-05798-1

Access Statistics for this article

Annals of Operations Research is currently edited by Endre Boros

More articles in Annals of Operations Research from Springer
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().

 
Page updated 2025-05-10
Handle: RePEc:spr:annopr:v:347:y:2025:i:2:d:10.1007_s10479-023-05798-1