Economics at your fingertips  

Information value of property description: A Machine learning approach

Lily Shen and Stephen Ross

Journal of Urban Economics, 2021, vol. 121, issue C

Abstract: This paper employs machine learning to quantify the value of “soft” information contained in real estate property descriptions. Textual descriptions contain information that traditional hedonic attributes cannot capture. A one standard deviation increase in the uniqueness of a property based on this “soft” information leads to a 15% increase in property sale price in a hedonic price model and a 10% increase in a repeat sales price model. The effects in the hedonic model appear to arise through two channels: the unobserved quality of the housing unit, and the market power of the housing unit relative to competing properties. The effects in the repeat sales model appear to be driven entirely by the market power of the unit. Further, an annual hedonic price index ignoring our measure of unobserved quality overstates real estate prices by between 10% to 23% and mistimes the stabilization of housing prices following the Great Recession. Similar, but smaller effects, are observed for the repeat sales price index.

Keywords: Natural language processing; Unsupervised machine learning; Soft information; Housing prices; Price indexes; Property descriptions (search for similar items in EconPapers)
JEL-codes: R31 G12 G14 C45 (search for similar items in EconPapers)
Date: 2021
References: View references in EconPapers View complete reference list from CitEc
Citations: Track citations by RSS feed

Downloads: (external link)
Full text for ScienceDirect subscribers only

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link:

DOI: 10.1016/j.jue.2020.103299

Access Statistics for this article

Journal of Urban Economics is currently edited by S.S. Rosenthal and W.C. Strange

More articles in Journal of Urban Economics from Elsevier
Bibliographic data for series maintained by Nithya Sathishkumar ().

Page updated 2021-03-09
Handle: RePEc:eee:juecon:v:121:y:2021:i:c:s009411902030070x