Online Self-Organizing Network Control with Time Averaged Weighted Throughput Objective

Zhang, Zhicong; Li, Shuai; Yan, Xiaohui

Online Self-Organizing Network Control with Time Averaged Weighted Throughput Objective

Zhicong Zhang, Shuai Li and Xiaohui Yan

Discrete Dynamics in Nature and Society, 2018, vol. 2018, 1-11

Abstract:

We study an online multisource multisink queueing network control problem characterized with self-organizing network structure and self-organizing job routing. We decompose the self-organizing queueing network control problem into a series of interrelated Markov Decision Processes and construct a control decision model for them based on the coupled reinforcement learning (RL) architecture. To maximize the mean time averaged weighted throughput of the jobs through the network, we propose a reinforcement learning algorithm with time averaged reward to deal with the control decision model and obtain a control policy integrating the jobs routing selection strategy and the jobs sequencing strategy. Computational experiments verify the learning ability and the effectiveness of the proposed reinforcement learning algorithm applied in the investigated self-organizing network control problem.

Date: 2018
References: Add references at CitEc
Citations:

Downloads: (external link)
http://downloads.hindawi.com/journals/DDNS/2018/4184805.pdf (application/pdf)
http://downloads.hindawi.com/journals/DDNS/2018/4184805.xml (text/xml)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:hin:jnddns:4184805

DOI: 10.1155/2018/4184805

Access Statistics for this article

More articles in Discrete Dynamics in Nature and Society from Hindawi
Bibliographic data for series maintained by Mohamed Abdelhakeem ().