EconPapers    
Economics at your fingertips  
 

GoCJ: Google Cloud Jobs Dataset for Distributed and Cloud Computing Infrastructures

Altaf Hussain and Muhammad Aleem
Additional contact information
Altaf Hussain: Department of Computer Science, Faculty of Computing, Capital University of Science and Technology, Islamabad 44000, Pakistan
Muhammad Aleem: Department of Computer Science, Faculty of Computing, Capital University of Science and Technology, Islamabad 44000, Pakistan

Data, 2018, vol. 3, issue 4, 1-12

Abstract: Developers of resource-allocation and scheduling algorithms share test datasets (i.e., benchmarks) to enable others to compare the performance of newly developed algorithms. However, mostly it is hard to acquire real cloud datasets due to the users’ data confidentiality issues and policies maintained by Cloud Service Providers (CSP). Accessibility of large-scale test datasets, depicting the realistic high-performance computing requirements of cloud users, is very limited. Therefore, the publicly available real cloud dataset will significantly encourage other researchers to compare and benchmark their applications using an open-source benchmark. To meet these objectives, the contemporary state of the art has been scrutinized to explore a real workload behavior in Google cluster traces. Starting from smaller- to moderate-size cloud computing infrastructures, the dataset generation process is demonstrated using the Monte Carlo simulation method to produce a Google Cloud Jobs (GoCJ) dataset based on the analysis of Google cluster traces. With this article, the dataset is made publicly available to enable other researchers in the field to investigate and benchmark their scheduling and resource-allocation schemes for the cloud. The GoCJ dataset is archived and available on the Mendeley Data repository.

Keywords: GoCJ dataset; meta-task dataset; HPC dataset; scientific dataset (search for similar items in EconPapers)
JEL-codes: C8 C80 C81 C82 C83 (search for similar items in EconPapers)
Date: 2018
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (2)

Downloads: (external link)
https://www.mdpi.com/2306-5729/3/4/38/pdf (application/pdf)
https://www.mdpi.com/2306-5729/3/4/38/ (text/html)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:gam:jdataj:v:3:y:2018:i:4:p:38-:d:172608

Access Statistics for this article

Data is currently edited by Ms. Cecilia Yang

More articles in Data from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().

 
Page updated 2025-03-19
Handle: RePEc:gam:jdataj:v:3:y:2018:i:4:p:38-:d:172608