A Distributed Framework for Predictive Analytics Using Big Data and MapReduce Parallel Programming
P. Natesan,
V. E. Sathishkumar,
Sandeep Kumar Mathivanan,
Maheshwari Venkatasen,
Prabhu Jayagopal,
Shaikh Muhammad Allayear and
Adiel T. de Almeida-Filho
Mathematical Problems in Engineering, 2023, vol. 2023, 1-10
Abstract:
With the advancement of Internet technologies and the rapid increase of World Wide Web applications, there has been tremendous growth in the volume of digital data. This takes the digital world into a new era of big data. Various existing data processing technologies are not consistent and scalable in handling the complexity as well as the large-size datasets. Recently, there are many distributed data processing, and programming models have been proposed and implemented to handle big data applications. The open-source-implemented MapReduce programming model in Apache Hadoop is the foremost model for data exhaustive and also computational-intensive applications due to its inherent characteristics of scalability, fault tolerance, and simplicity. In this research article, a new approach for the prediction of target labels in big data applications is developed using a multiple linear regression algorithm and MapReduce programming model, named as MR-MLR. This approach promises optimum values for MAE, RMSE, and determination coefficient (R2) and thus shows its effectiveness in predictions in big data applications.
Date: 2023
References: Add references at CitEc
Citations:
Downloads: (external link)
http://downloads.hindawi.com/journals/mpe/2023/6048891.pdf (application/pdf)
http://downloads.hindawi.com/journals/mpe/2023/6048891.xml (application/xml)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:hin:jnlmpe:6048891
DOI: 10.1155/2023/6048891
Access Statistics for this article
More articles in Mathematical Problems in Engineering from Hindawi
Bibliographic data for series maintained by Mohamed Abdelhakeem ().