Semantic image editing for reshaping architecture of power: Lesson learned from selected cases

Filipiak, Dominik; Sobiczewska, Julita

Semantic image editing for reshaping architecture of power: Lesson learned from selected cases

Dominik Filipiak () and Julita Sobiczewska ()
Additional contact information
Dominik Filipiak: Adam Mickiewicz University, Poznan, PolandPerelyn, Warsaw, Poland

Smart Cities and Regional Development (SCRD) Journal, 2025, vol. 9, issue 4, 7-33

Abstract: Objectives We explore an evaluation scheme for assessment of generative computer vision models in architecture-related tasks with a focus on text-conditioned image editing for use cases relating to architecture of power. It is an umbrella term for building ranging from Socialist Realism to Post-War Modernism. While some of them can be considered landmarks on former Eastern Bloc countries, they often lack modern features, such as accessibility. With a recent progress in generative vision, the diffusion pipelines can be used to reimagine such buildings with pictures, which may later provide a blueprint for transforming such sites. Prior work While an intense effort can be observed in image generation models (including semantic image editing) and their applications (such as architecture), evaluating domain-specific benchmarks is still cumbersome. The case of architecture of power carries unique challenges, as it is a domain rather underrepresented in the publicly available datasets on which many models are pretrained. Results We present selected results of our evaluation schema for assessing generative vision models for various tasks related to improving mid-20th century architecture, which consist of taxonomy of tasks. We also demonstrate the proposed approach on a several state-of-the-art text- and image-conditioned diffusion models and pipelines (such as DiffEdit, Kandinsky, or ControlNet) for selected buildings in Warsaw, Cracow, Riga, and Bucharest. Implications While the presented evaluation scheme is rather intended to be used by researchers, the results of such an assessment can be used to select models most suitable for the architecture and urban planning communities. Since we focus on text-conditioned models, they can be used by general audience to help reimagining the buildings according to their need.

Keywords: semantic image editing; architecture of power; sustainability; evaluation; benchmark (search for similar items in EconPapers)
Date: 2025
References: Add references at CitEc
Citations:

Downloads: (external link)
https://scrd.eu/index.php/scrd/article/view/728/760 (application/pdf)
https://scrd.eu/index.php/scrd/article/view/728 (text/html)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:pop:journl:v:9:y:2025:i:4:p:7-33

DOI: 10.25019/4w9sh824

Access Statistics for this article

More articles in Smart Cities and Regional Development (SCRD) Journal from Smart-EDU Hub, Faculty of Public Administration, National University of Political Studies & Public Administration Contact information at EDIRC.
Bibliographic data for series maintained by Professor Catalin Vrabie ().