EconPapers    
Economics at your fingertips  
 

Understanding Housing Prices Using Geographic Big Data: A Case Study in Shenzhen

Xufeng Jiang, Zelu Jia, Lefei Li and Tianhong Zhao
Additional contact information
Xufeng Jiang: Aerospace Information Research Institute, Chinese Academy of Sciences, Beijing 100094, China
Zelu Jia: Government Services Data Bureau of Bao’an District Shenzhen Municipality, Shenzhen 518000, China
Lefei Li: School of Architecture and Urban Planning, Shenzhen University, Shenzhen 518000, China
Tianhong Zhao: School of Architecture and Urban Planning, Shenzhen University, Shenzhen 518000, China

Sustainability, 2022, vol. 14, issue 9, 1-20

Abstract: Understanding the spatial pattern of urban house prices and its association with the built environment is of great significance to housing policymaking and urban planning. However, many studies on the influencing factors of urban housing prices conduct qualitative analyses using statistical data and manual survey data. In addition, traditional housing price models are mostly linear models that cannot explain the distribution of housing prices in urban areas. In this paper, we propose using geographic big data and zonal nonlinear feature machine learning models to understand housing prices. First, the housing price influencing factor system is built based on the hedonic pricing model and geographic big data, and it includes commercial development, transportation, infrastructure, location, education, environment, and residents’ consumption level. Second, a spatial exploratory analysis framework for house price data was constructed using Moran’s I tools and geographic detectors. Finally, the XGBoost model is developed to assess the importance of the variables influencing housing prices, and the zonal nonlinear feature model is built to predict housing prices based on spatial exploration results. Taking Shenzhen as an example, this paper explored the distribution law of housing prices, analyzed the influencing factors of housing prices, and compared the different housing price models. The results show that the zonal nonlinear feature model has higher accuracy than the linear model and the global model.

Keywords: housing price model; machine learning; big data; regression analysis; built environment (search for similar items in EconPapers)
JEL-codes: O13 Q Q0 Q2 Q3 Q5 Q56 (search for similar items in EconPapers)
Date: 2022
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (2)

Downloads: (external link)
https://www.mdpi.com/2071-1050/14/9/5307/pdf (application/pdf)
https://www.mdpi.com/2071-1050/14/9/5307/ (text/html)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:gam:jsusta:v:14:y:2022:i:9:p:5307-:d:804159

Access Statistics for this article

Sustainability is currently edited by Ms. Alexandra Wu

More articles in Sustainability from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().

 
Page updated 2025-03-19
Handle: RePEc:gam:jsusta:v:14:y:2022:i:9:p:5307-:d:804159