Evaluation of Thermal Comfort in Urban Commercial Space with Vision–Language-Model-Based Agent Model
Dongyi Zhang, 
Zihao Xiong and 
Xun Zhu ()
Additional contact information 
Dongyi Zhang: Creative Computing Institute, University of the Arts London, London WC1V 7EY, UK
Zihao Xiong: School of Architecture & Design, University for the Creative Arts, Canterbury CT1 3AN, UK
Xun Zhu: School of Architecture and Design, Harbin Institute of Technology, Harbin 150006, China
Land, 2025, vol. 14, issue 4, 1-18
Abstract:
Thermal comfort in urban commercial spaces significantly impacts both business performance and public well-being. Traditional evaluation methods relying on field surveys and expert assessments are often time-consuming and labor-intensive. This study proposes a novel vision–language model (VLM)-based agent system for thermal comfort assessment in commercial spaces, simulating eight distinct heat-sensitive roles with varied demographic backgrounds through prompt engineering using ChatGPT-4o. Taking Harbin Central Street, China as a case study, we first validated model accuracy through ASHRAE scale evaluations of 30% samples (167 images) by 50 experts, and then conducted thermal comfort simulations of eight heat-sensitive roles followed by spatial and interpretability analyses. Key findings include (1) a significant correlation between VLM assessments and expert evaluations (r = 0.815, p < 0.001), confirming method feasibility; (2) notable heterogeneity in thermal comfort evaluations across eight agents, demonstrating the VLMs’ capacity to capture perceptual differences among social groups; (3) spatial analysis revealing higher thermal comfort in eastern regions compared to western and central areas despite inter-role variations, demonstrating consistency among agents; and (4) the shade and vegetation being identified as primary influencing factors that contribute to the agent’s decision making. This research validates VLM-based agents’ effectiveness in urban thermal comfort evaluation, showcasing their dual capability in replicating traditional methods while capturing social group differences. The proposed approach establishes a novel paradigm for efficient, comprehensive, and multi-perspective thermal comfort assessments in urban commercial environments.
Keywords: vision–language models (VLMs); thermal comfort; commercial space; urban planning; agent-based model (search for similar items in EconPapers)
JEL-codes: Q15 Q2 Q24 Q28 Q5 R14 R52  (search for similar items in EconPapers)
Date: 2025
References: View references in EconPapers View complete reference list from CitEc 
Citations: 
Downloads: (external link)
https://www.mdpi.com/2073-445X/14/4/786/pdf (application/pdf)
https://www.mdpi.com/2073-445X/14/4/786/ (text/html)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX 
RIS (EndNote, ProCite, RefMan) 
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:gam:jlands:v:14:y:2025:i:4:p:786-:d:1629038
Access Statistics for this article
Land is currently edited by Ms. Carol Ma
More articles in Land  from  MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().