EconPapers    
Economics at your fingertips  
 

PERFORMANCE ANALYSIS OF TWO BIG DATA TECHNOLOGIES ON A CLOUD DISTRIBUTED ARCHITECTURE. RESULTS FOR NON-AGGREGATE QUERIES ON MEDIUM-SIZED DATA

Marin Fotache and IonuÈ› Hrubaru

Scientific Annals of Economics and Business (continues Analele Stiintifice), 2017, vol. 63, issue si, 21 - 50

Abstract: Big Data systems manage and process huge volumes of data constantly generated by various technologies in a myriad of formats. Big Data advocates (and preachers) have claimed that, relative to classical, relational/SQL Data Base Management Systems, Big Data technologies such as NoSQL, Hadoop and in-memory data stores perform better. This paper compares data processing performance of two systems belonging to SQL (PostgreSQL/Postgres XL) and Big Data (Hadoop/Hive) camps on a distributed five-node cluster deployed in cloud. Unlike benchmarks in use (YCSB, TPC), a series of R modules were devised for generating random non-aggregate queries on different subschema (with increasing data size) of TPC-H database. Overall performance of the two systems was compared. Subsequently a number of models were developed for relating performance on the system and also on various query parameters such as the number of attributes in SELECT and WHERE clause, number of joins, number of processing rows etc. JEL Codes - M15

Keywords: Big Data; cloud computing; performance benchmarks; Hadoop; Hive; PostgreSQL; Postgres XL; R (search for similar items in EconPapers)
Date: 2017
References: Add references at CitEc
Citations:

Downloads: (external link)
http://saeb.feaa.uaic.ro/index.php/saeb/article/view/1035 (application/pdf)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:aic:saebjn:v:63:y:2017:i:si:p:21-50:n:62

DOI: 10.1515/saeb-2016-0134

Access Statistics for this article

More articles in Scientific Annals of Economics and Business (continues Analele Stiintifice) from Alexandru Ioan Cuza University, Faculty of Economics and Business Administration Contact information at EDIRC.
Bibliographic data for series maintained by Sireteanu Napoleon-Alexandru ().

 
Page updated 2025-03-19
Handle: RePEc:aic:saebjn:v:63:y:2017:i:si:p:21-50:n:62