Optimizing Financial Data Analysis: A Comparative Study of Preprocessing Techniques for Regression Modeling of Apple Inc.’S Net Income and Stock Prices
Ungar Kevin () and
Camelia Oprean-Stan
Additional contact information
Ungar Kevin: Lucian Blaga University of Sibiu, Faculty of Economic Sciences, Sibiu, Romania
Studia Universitatis „Vasile Goldis” Arad – Economics Series, 2025, vol. 35, issue 1, 49-82
Abstract:
This article presents a comprehensive methodology for processing financial datasets of Apple Inc., encompassing quarterly income and daily stock prices, spanning from March 31, 2009, to December 31, 2023. Leveraging 60 observations for quarterly income and 3774 observations for daily stock prices, sourced from Macrotrends and Yahoo Finance respectively, the study outlines five distinct datasets crafted through varied preprocessing techniques. Through detailed explanations of aggregation, interpolation (linear, polynomial, and cubic spline) and lagged variables methods, the study elucidates the steps taken to transform raw data into analytically rich datasets. Subsequently, the article delves into regression analysis, aiming to decipher which of the five data processing methods best suits capital market analysis, by employing both linear and polynomial regression models on each preprocessed dataset and evaluating their performance using a range of metrics, including cross-validation score, MSE, MAE, RMSE, R-squared, and Adjusted R-squared. The research findings reveal that linear interpolation with polynomial regression emerges as the top-performing method, boasting the lowest validation MSE and MAE values, alongside the highest R-squared and Adjusted R-squared values.
Keywords: linear regression analysis; polynomial regression; stock prices; financial data processing; Python programming (search for similar items in EconPapers)
JEL-codes: C45 G14 G21 G32 (search for similar items in EconPapers)
Date: 2025
References: View references in EconPapers View complete reference list from CitEc
Citations:
Downloads: (external link)
https://doi.org/10.2478/sues-2025-0004 (text/html)
Related works:
Working Paper: Optimizing Financial Data Analysis: A Comparative Study of Preprocessing Techniques for Regression Modeling of Apple Inc.'s Net Income and Stock Prices (2025) 
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:vrs:suvges:v:35:y:2025:i:1:p:49-82:n:1004
DOI: 10.2478/sues-2025-0004
Access Statistics for this article
Studia Universitatis „Vasile Goldis” Arad – Economics Series is currently edited by Florin Cornel Dumiter
More articles in Studia Universitatis „Vasile Goldis” Arad – Economics Series from Sciendo
Bibliographic data for series maintained by Peter Golla ().