N-player Pareto game for the discrete-time Markov jump systems with multiplicative noises using off-policy reinforcement learning

Bo, Chunling; Lin, Yaning; Li, Yuxia

N-player Pareto game for the discrete-time Markov jump systems with multiplicative noises using off-policy reinforcement learning

Chunling Bo, Yaning Lin and Yuxia Li

Mathematics and Computers in Simulation (MATCOM), 2026, vol. 245, issue C, 464-482

Abstract: This paper investigates a discrete-time N-player Pareto cooperative game problem for the Markov jump systems with multiplicative noises. First, utilizing the convexity condition, the Pareto cooperative game problem is transformed into a weighted sum optimal control problem. To address this problem, we reconstruct the system and prove that the optimal control gains do not change after the reconstruction. Compared with the traditional policy iteration algorithm which is sensitive to probing noise, the proposed model-based off-policy algorithm yields unbiased solution to Pareto cooperative game problem when probing noise is added. Due to the complexity of the jump systems and the multiplicative noises, relying on its dynamic information may cause policy bias in some situations. To further eliminate the dependence on the system information of model-based off-policy algorithm, a model-free off-policy algorithm is designed based on the properties of the trace using only information of the system states and inputs. In addition, the convergence of the proposed algorithm and the stability of the system under each iteration step are proved. Finally, practical numerical examples are given to show the effectiveness of the algorithm.

Keywords: Discrete-time stochastic systems; Markov jumps; Multi-player Pareto game; Off-policy reinforcement learning (search for similar items in EconPapers)
Date: 2026
References: Add references at CitEc
Citations:

Downloads: (external link)
http://www.sciencedirect.com/science/article/pii/S0378475426000662
Full text for ScienceDirect subscribers only

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:eee:matcom:v:245:y:2026:i:c:p:464-482

DOI: 10.1016/j.matcom.2026.02.020

Access Statistics for this article

Mathematics and Computers in Simulation (MATCOM) is currently edited by Robert Beauwens

More articles in Mathematics and Computers in Simulation (MATCOM) from Elsevier
Bibliographic data for series maintained by Catherine Liu ().