A Deep Learning Semantic Segmentation Method for Landslide Scene Based on Transformer Architecture

Wang, Zhaoqiu; Sun, Tao; Hu, Kun; Zhang, Yueting; Yu, Xiaqiong; Li, Ying

A Deep Learning Semantic Segmentation Method for Landslide Scene Based on Transformer Architecture

Zhaoqiu Wang, Tao Sun (), Kun Hu, Yueting Zhang (), Xiaqiong Yu and Ying Li
Additional contact information
Zhaoqiu Wang: Aerospace Information Research Institute, Chinese Academy of Sciences, Beijing 100094, China
Tao Sun: Aerospace Information Research Institute, Chinese Academy of Sciences, Beijing 100094, China
Kun Hu: Institute of Artificial Intelligence, Beihang University, Beijing 100191, China
Yueting Zhang: Aerospace Information Research Institute, Chinese Academy of Sciences, Beijing 100094, China
Xiaqiong Yu: Satellite Application Center, Beijing 100094, China
Ying Li: Airlook Aviation Technology (Beijing) Co., Ltd., Beijing 100070, China

Sustainability, 2022, vol. 14, issue 23, 1-22

Abstract: Semantic segmentation technology based on deep learning has developed rapidly. It is widely used in remote sensing image recognition, but is rarely used in natural disaster scenes, especially in landslide disasters. After a landslide disaster occurs, it is necessary to quickly carry out rescue and ecological restoration work, using satellite data or aerial photography data to quickly analyze the landslide area. However, the precise location and area estimation of the landslide area is still a difficult problem. Therefore, we propose a deep learning semantic segmentation method based on Encoder-Decoder architecture for landslide recognition, called the Separable Channel Attention Network (SCANet). The SCANet consists of a Poolformer encoder and a Separable Channel Attention Feature Pyramid Network (SCA-FPN) decoder. Firstly, the Poolformer can extract global semantic information at different levels with the help of transformer architecture, and it greatly reduces computational complexity of the network by using pooling operations instead of a self-attention mechanism. Secondly, the SCA-FPN we designed can fuse multi-scale semantic information and complete pixel-level prediction of remote sensing images. Without bells and whistles, our proposed SCANet outperformed the mainstream semantic segmentation networks with fewer model parameters on our self-built landslide dataset. The mIoU scores of SCANet are 1.95% higher than ResNet50-Unet, especially.

Keywords: landslide; remote sensing images; semantic segmentation; Poolformer; separable channel attention (search for similar items in EconPapers)
JEL-codes: O13 Q Q0 Q2 Q3 Q5 Q56 (search for similar items in EconPapers)
Date: 2022
References: View complete reference list from CitEc
Citations: View citations in EconPapers (1)

Downloads: (external link)
https://www.mdpi.com/2071-1050/14/23/16311/pdf (application/pdf)
https://www.mdpi.com/2071-1050/14/23/16311/ (text/html)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:gam:jsusta:v:14:y:2022:i:23:p:16311-:d:995466

Access Statistics for this article

Sustainability is currently edited by Ms. Alexandra Wu

More articles in Sustainability from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().