A Hybrid Machine Learning Model Architecture with Clustering Analysis and Stacking Ensemble for Real Estate Price Prediction
Cihan Çılgın () and
Hadi Gökçen ()
Additional contact information
Cihan Çılgın: Bolu Abant İzzet Baysal University
Hadi Gökçen: Gazi University
Computational Economics, 2025, vol. 66, issue 1, No 5, 127-178
Abstract:
Abstract Population growth, rapid developments in technology, increase in living standards, changes in the household structure and economic structure of societies, and the increase in urbanization at very high rates, as well as the increase in the demand for renting or purchasing real estate, have both expanded the real estate market and made it more active. This intense activity in the real estate markets also accelerates real estate price prediction studies in direct proportion. The aim of this study is to present a model architecture that can achieve high accuracy in predicting the current market value of real estates by using a hybrid approach, through clustering models as a preliminary approach, in order to achieve higher homogeneity with stacking ensemble using multiple machine learning methods. In order to obtain more homogeneous submarkets, the collected data set was first grouped according to the number of rooms and then each group was divided into clusters by cluster analysis. In this way, more homogeneous submarkets were obtained and predict accuracy was improved. Then, the training process was carried out for 13 different weak learners using fivefold cross-validation for each determined sub-market. Feature selection and parameter optimization were performed separately for each weak learner. Then, the predictions obtained according to the feature and parameter set that gave the best results were used to train the meta-learner. As a result of this entire process, the final prediction was created with the meta learner that gave the least error rate. As the findings show, high predicting performance at international standards has been demonstrated even in a period of high price fluctuations for many and various sub-markets of real estate.
Keywords: Real estate price prediction; Machine learning; Stacking ensemble; Hybrid; Clustering analysis (search for similar items in EconPapers)
Date: 2025
References: Add references at CitEc
Citations:
Downloads: (external link)
http://link.springer.com/10.1007/s10614-024-10703-4 Abstract (text/html)
Access to the full text of the articles in this series is restricted.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:kap:compec:v:66:y:2025:i:1:d:10.1007_s10614-024-10703-4
Ordering information: This journal article can be ordered from
http://www.springer. ... ry/journal/10614/PS2
DOI: 10.1007/s10614-024-10703-4
Access Statistics for this article
Computational Economics is currently edited by Hans Amman
More articles in Computational Economics from Springer, Society for Computational Economics Contact information at EDIRC.
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().