To Pool or Not to Pool: Queueing Design for Large-Scale Service Systems
Ping Cao (),
Shuangchi He (),
Junfei Huang () and
Yunan Liu ()
Additional contact information
Ping Cao: School of Management, University of Science and Technology of China, 230026 Hefei, China
Shuangchi He: Department of Industrial Systems Engineering and Management, National University of Singapore, Singapore 117576
Junfei Huang: Department of Decision Sciences and Managerial Economics, CUHK Business School, Chinese University of Hong Kong, Shatin, Hong Kong
Yunan Liu: Department of Industrial and Systems Engineering, North Carolina State University, Raleigh, North Carolina 27695
Operations Research, 2021, vol. 69, issue 6, 1866-1885
Abstract:
There are two basic queue structures commonly adopted in service systems: the pooled structure, where waiting customers are organized into a single queue served by a group of servers, and the dedicated structure, where each server has her own queue. Although the pooled structure, known to minimize the servers’ idle time, is widely used in large-scale service systems, this study reveals that the dedicated structure, along with the join-the-shortest-queue routing policy, could be more advantageous for improving certain performance measures, such as the probability of a customer’s waiting time being within a delay target. The servers’ additional idleness resulting from the dedicated structure will be negligible when the system scale is large. Using a fluid model substantiated by asymptotic analysis, we provide a performance comparison between the two structures for a moderately overloaded queueing system with customer abandonment. We intend to help service system designers answer the following question: To reach a specified service-level target, which queue structure will be more cost effective? Aside from structure design, our results are of practical value for performance analysis and staffing deployment.
Keywords: dedicated queue; join-the-shortest-queue; power-of- d; join-idle-queues; customer abandonment; overloaded queue; many-server heavy-traffic limit (search for similar items in EconPapers)
Date: 2021
References: Add references at CitEc
Citations:
Downloads: (external link)
http://dx.doi.org/10.1287/opre.2019.1976 (application/pdf)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:inm:oropre:v:69:y:2021:i:6:p:1866-1885
Access Statistics for this article
More articles in Operations Research from INFORMS Contact information at EDIRC.
Bibliographic data for series maintained by Chris Asher ().