EconPapers    
Economics at your fingertips  
 

Extraction of Water Bodies from High-Resolution Aerial and Satellite Images Using Visual Foundation Models

Samed Ozdemir, Zeynep Akbulut, Fevzi Karsli and Taskin Kavzoglu ()
Additional contact information
Samed Ozdemir: Department of Geomatics Engineering, Faculty of Engineering and Natural Sciences, Gumushane University, 29100 Gumushane, Turkey
Zeynep Akbulut: Department of Geomatics Engineering, Faculty of Engineering and Natural Sciences, Gumushane University, 29100 Gumushane, Turkey
Fevzi Karsli: Department of Geomatics Engineering, Faculty of Engineering, Karadeniz Technical University, 61080 Trabzon, Turkey
Taskin Kavzoglu: Department of Geomatics Engineering, Faculty of Engineering, Gebze Technical University, 41400 Kocaeli, Turkey

Sustainability, 2024, vol. 16, issue 7, 1-23

Abstract: Water, indispensable for life and central to ecosystems, human activities, and climate dynamics, requires rapid and accurate monitoring. This is vital for sustaining ecosystems, enhancing human welfare, and effectively managing land, water, and biodiversity on both the local and global level. In the rapidly evolving domain of remote sensing and deep learning, this study focuses on water body extraction and classification through the use of recent deep learning models of visual foundation models (VFMs). Specifically, the Segment Anything Model (SAM) and Contrastive Language-Image Pre-training (CLIP) models have shown promise in semantic segmentation, dataset creation, change detection, and instance segmentation tasks. A novel two-step approach involving segmenting images via the Automatic Mask Generator method of the SAM and the zero-shot classification of segments using CLIP is proposed, and its effectiveness is tested on water body extraction problems. The proposed methodology was applied to both remote sensing imagery acquired from LANDSAT 8 OLI and very high-resolution aerial imagery. Results revealed that the proposed methodology accurately delineated water bodies across complex environmental conditions, achieving a mean intersection over union (IoU) of 94.41% and an F1 score of 96.97% for satellite imagery. Similarly, for the aerial imagery dataset, the proposed methodology achieved a mean IoU of 90.83% and an F1 score exceeding 94.56%. The high accuracy achieved in selecting segments predominantly classified as water highlights the effectiveness of the proposed model in intricate environmental image analysis.

Keywords: visual foundation models; Segment Anything Model; CLIP; water bodies; semantic; segmentation (search for similar items in EconPapers)
JEL-codes: O13 Q Q0 Q2 Q3 Q5 Q56 (search for similar items in EconPapers)
Date: 2024
References: View references in EconPapers View complete reference list from CitEc
Citations:

Downloads: (external link)
https://www.mdpi.com/2071-1050/16/7/2995/pdf (application/pdf)
https://www.mdpi.com/2071-1050/16/7/2995/ (text/html)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:gam:jsusta:v:16:y:2024:i:7:p:2995-:d:1369794

Access Statistics for this article

Sustainability is currently edited by Ms. Alexandra Wu

More articles in Sustainability from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().

 
Page updated 2025-03-19
Handle: RePEc:gam:jsusta:v:16:y:2024:i:7:p:2995-:d:1369794