EconPapers    
Economics at your fingertips  
 

Optimal task replication considering reliability, performance, and energy consumption for parallel computing in cloud systems

Xiwei Qiu, Peng Sun and Yuanshun Dai

Reliability Engineering and System Safety, 2021, vol. 215, issue C

Abstract: In a cloud-based cyber–physical system, many jobs consist of multiple parallel tasks. The cloud system usually adopts active task replication to improve performance and guarantee the reliability of a job. This technology creates redundant replicas for each task and then executes the replicas concurrently. In the cloud system, each replica is a virtual machine (VM) image that can be easily assigned to different physical machines (PMs) to overcome resource heterogeneity. However, how to design a rational task replication strategy (including replica creation and VM assignment) is indeed a complex issue. It should comprehensively consider correlations and tradeoffs among reliability, performance, and energy consumption. This paper first proposes a reliability–performance correlation model for a job executed by using active task replication. We design a general method to avoid analyzing complex failure correlations and give a Bayesian approach to calculate the performability metric of the job. The paper also proposes a reliability–energy correlation model to evaluate random energy consumption of a PM hosting multiple VMs by using mixed random variables. Finally, an expected net profit optimization model and a genetic algorithm are developed to search for an optimal task replication strategy balancing tradeoffs among reliability, performance, and energy consumption. Illustrative examples are provided.

Keywords: Active task replication; Correlation modeling; Parallel and redundant computing; Cloud computing (search for similar items in EconPapers)
Date: 2021
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (1)

Downloads: (external link)
http://www.sciencedirect.com/science/article/pii/S0951832021003549
Full text for ScienceDirect subscribers only

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:eee:reensy:v:215:y:2021:i:c:s0951832021003549

DOI: 10.1016/j.ress.2021.107834

Access Statistics for this article

Reliability Engineering and System Safety is currently edited by Carlos Guedes Soares

More articles in Reliability Engineering and System Safety from Elsevier
Bibliographic data for series maintained by Catherine Liu ().

 
Page updated 2025-06-07
Handle: RePEc:eee:reensy:v:215:y:2021:i:c:s0951832021003549