Green Data Analytics of Supercomputing from Massive Sensor Networks: Does Workload Distribution Matter?

Guo, Zhiling; Li, Jin; Ramesh, Ram

Green Data Analytics of Supercomputing from Massive Sensor Networks: Does Workload Distribution Matter?

Zhiling Guo (), Jin Li () and Ram Ramesh ()
Additional contact information
Zhiling Guo: School of Computing and Information Systems, Singapore Management University, Singapore 178902
Jin Li: School of Management, Xi’an Jiaotong University, Xi’an 710049, China
Ram Ramesh: Department of Management Science and Systems, State University of New York, Buffalo, New York 14260

Information Systems Research, 2023, vol. 34, issue 4, 1664-1685

Abstract: Energy costs represent a significant share of the total cost of ownership in high-performance computing (HPC) systems. Using a unique data set collected by massive sensor networks in a petascale national supercomputing center, we first present an explanatory model to identify key factors that affect energy consumption in supercomputing. Our analytic results show that, not only does computing node utilization significantly affect energy consumption, workload distribution among the nodes also has significant effects and could effectively be leveraged to improve energy efficiency. Next, we establish the high model performance using in-sample and out-of-sample analyses. We then develop prescriptive models for energy-optimal runtime workload management and extend the models to consider energy consumption and job performance tradeoffs. Specifically, we present four dynamic resource management methodologies ( packing , load balancing , threshold-based switching, and energy optimization ), model their application at two levels (purely within-rack and jointly cross-rack resource allocation), and explore runtime resource redistribution policies for jobs under the emergent principle of computational steering and comparatively evaluate strategies that use computational steering with those that do not. Our experimental studies show that packing is preferred when the total workload of a rack is higher than a threshold and load balancing is preferred when it is lower. These results lead to a threshold strategy that yields near-optimal energy efficiency under all workload conditions. We further calibrate the energy-optimal resource allocations over the full range of workloads and present a bicriteria evaluation to consider energy consumption and job performance tradeoffs. We demonstrate significant energy savings of our proposed strategies under various workload conditions. We conclude with implementation guidelines and policy insights into energy-efficient computing resource management in large supercomputing data centers.

Keywords: high-performance computing; data center; energy-efficient operation; data analytics; autoregressive model; dynamic panel data; optimization (search for similar items in EconPapers)
Date: 2023
References: View references in EconPapers View complete reference list from CitEc
Citations:

Downloads: (external link)
http://dx.doi.org/10.1287/isre.2023.1208 (application/pdf)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:inm:orisre:v:34:y:2023:i:4:p:1664-1685

Access Statistics for this article

More articles in Information Systems Research from INFORMS Contact information at EDIRC.
Bibliographic data for series maintained by Chris Asher ().