Deep Reinforcement Learning-Based Scheduler on Parallel Dedicated Machine Scheduling Problem towards Minimizing Total Tardiness

Lee, Donghun; Kang, Hyeongwon; Lee, Dongjin; Lee, Jeonwoo; Kim, Kwanho

Deep Reinforcement Learning-Based Scheduler on Parallel Dedicated Machine Scheduling Problem towards Minimizing Total Tardiness

Donghun Lee, Hyeongwon Kang, Dongjin Lee, Jeonwoo Lee and Kwanho Kim ()
Additional contact information
Donghun Lee: Department of Industrial and Management Engineering, Incheon National University, Incheon 22012, Republic of Korea
Hyeongwon Kang: Department of Industrial and Management Engineering, Incheon National University, Incheon 22012, Republic of Korea
Dongjin Lee: Department of Industrial and Management Engineering, Incheon National University, Incheon 22012, Republic of Korea
Jeonwoo Lee: Department of Industrial and Management Engineering, Incheon National University, Incheon 22012, Republic of Korea
Kwanho Kim: Department of Industrial and Management Engineering, Incheon National University, Incheon 22012, Republic of Korea

Sustainability, 2023, vol. 15, issue 4, 1-14

Abstract: This study considers a parallel dedicated machine scheduling problem towards minimizing the total tardiness of allocated jobs on machines. In addition, this problem comes under the category of NP-hard. Unlike classical parallel machine scheduling, a job is processed by only one of the dedicated machines according to its job type defined in advance, and a machine is able to process at most one job at a time. To obtain a high-quality schedule in terms of total tardiness for the considered scheduling problem, we suggest a machine scheduler based on double deep Q-learning. In the training phase, the considered scheduling problem is redesigned to fit into the reinforcement learning framework and suggest the concepts of state, action, and reward to understand the occurrences of setup, tardiness, and the statuses of allocated job types. The proposed scheduler, repeatedly finds better Q-values towards minimizing tardiness of allocated jobs by updating the weights in a neural network. Then, the scheduling performances of the proposed scheduler are evaluated by comparing it with the conventional ones. The results show that the proposed scheduler outperforms the conventional ones. In particular, for two datasets presenting extra-large scheduling problems, our model performs better compared to existing genetic algorithm by 12.32% and 29.69%.

Keywords: machine scheduling; deep reinforcement learning; parallel dedicated machines; sustainable manufacturing; total tardiness objective (search for similar items in EconPapers)
JEL-codes: O13 Q Q0 Q2 Q3 Q5 Q56 (search for similar items in EconPapers)
Date: 2023
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (1)

Downloads: (external link)
https://www.mdpi.com/2071-1050/15/4/2920/pdf (application/pdf)
https://www.mdpi.com/2071-1050/15/4/2920/ (text/html)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:gam:jsusta:v:15:y:2023:i:4:p:2920-:d:1059335

Access Statistics for this article

Sustainability is currently edited by Ms. Alexandra Wu

More articles in Sustainability from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().