Information value of property description: A Machine learning approach
Lily Shen and
Journal of Urban Economics, 2021, vol. 121, issue C
This paper employs machine learning to quantify the value of “soft” information contained in real estate property descriptions. Textual descriptions contain information that traditional hedonic attributes cannot capture. A one standard deviation increase in the uniqueness of a property based on this “soft” information leads to a 15% increase in property sale price in a hedonic price model and a 10% increase in a repeat sales price model. The effects in the hedonic model appear to arise through two channels: the unobserved quality of the housing unit, and the market power of the housing unit relative to competing properties. The effects in the repeat sales model appear to be driven entirely by the market power of the unit. Further, an annual hedonic price index ignoring our measure of unobserved quality overstates real estate prices by between 10% to 23% and mistimes the stabilization of housing prices following the Great Recession. Similar, but smaller effects, are observed for the repeat sales price index.
Keywords: Natural language processing; Unsupervised machine learning; Soft information; Housing prices; Price indexes; Property descriptions (search for similar items in EconPapers)
JEL-codes: R31 G12 G14 C45 (search for similar items in EconPapers)
References: View references in EconPapers View complete reference list from CitEc
Citations: Track citations by RSS feed
Downloads: (external link)
Full text for ScienceDirect subscribers only
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
Persistent link: https://EconPapers.repec.org/RePEc:eee:juecon:v:121:y:2021:i:c:s009411902030070x
Access Statistics for this article
Journal of Urban Economics is currently edited by S.S. Rosenthal and W.C. Strange
More articles in Journal of Urban Economics from Elsevier
Bibliographic data for series maintained by Nithya Sathishkumar ().