Recalculate Without Recomputing
José Dias Curto
Computational Economics, 2024, vol. 64, issue 6, No 5, 3279-3294
Abstract:
Abstract The available memory on a desktop or laptop computer can easily be exceeded by a large data set/stream when performing analytics or training machine learning algorithms. In this paper we propose simple methods to recalculate and update the value of descriptive statistics (averages, variance, coefficients of skewness and kurtosis) after rescaling, adding or excluding some observations. The coefficient of determination from regression analysis is also recalculated after deleting each sample observation. The objective is to avoid including all the observations (especially in a big data set/stream) when those statistics are recalculated. For this, only its previous value updated by the changed observations is needed, saving time in the respective calculation . Applications include GDP, stock prices of the ‘Magnificent Seven’ and Price-to-Earnings ratio.
Keywords: Descriptive statistics; Coefficient of determination; Rescaling; GDP; Stock split; Price-to-Earnings ratio (search for similar items in EconPapers)
Date: 2024
References: View references in EconPapers View complete reference list from CitEc
Citations:
Downloads: (external link)
http://link.springer.com/10.1007/s10614-024-10558-9 Abstract (text/html)
Access to the full text of the articles in this series is restricted.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:kap:compec:v:64:y:2024:i:6:d:10.1007_s10614-024-10558-9
Ordering information: This journal article can be ordered from
http://www.springer. ... ry/journal/10614/PS2
DOI: 10.1007/s10614-024-10558-9
Access Statistics for this article
Computational Economics is currently edited by Hans Amman
More articles in Computational Economics from Springer, Society for Computational Economics Contact information at EDIRC.
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().