Cloudformer V2: Set Prior Prediction and Binary Mask Weighted Network for Cloud Detection

Zhang, Zheng; Xu, Zhiwei; Liu, Chang’an; Tian, Qing; Zhou, Yongsheng

Cloudformer V2: Set Prior Prediction and Binary Mask Weighted Network for Cloud Detection

Zheng Zhang, Zhiwei Xu, Chang’an Liu, Qing Tian and Yongsheng Zhou
Additional contact information
Zheng Zhang: School of Information Science and Technology, North China University of Technology, Beijing 100144, China
Zhiwei Xu: School of Information Science and Technology, North China University of Technology, Beijing 100144, China
Chang’an Liu: School of Information Science and Technology, North China University of Technology, Beijing 100144, China
Qing Tian: School of Information Science and Technology, North China University of Technology, Beijing 100144, China
Yongsheng Zhou: College of Information Science and Technology, Beijing University of Chemical Technology, Beijing 100029, China

Mathematics, 2022, vol. 10, issue 15, 1-12

Abstract: Cloud detection is an essential step in optical remote sensing data processing. With the development of deep learning technology, cloud detection methods have made remarkable progress. Among them, researchers have started to try to introduce Transformer into cloud detection tasks due to its excellent performance in image semantic segmentation tasks. However, the current Transformer-based methods suffer from training difficulty and low detection accuracy of small clouds. To solve these problems, this paper proposes Cloudformer V2 based on the previously proposed Cloudformer. For the training difficulty, Cloudformer V2 uses Set Attention Block to extract intermediate features as Set Prior Prediction to participate in supervision, which enables the model to converge faster. For the detection of small clouds, Cloudformer V2 decodes the features by a multi-scale Transformer decoder, which uses multi-resolution features to improve the modeling accuracy. In addition, a binary mask weighted loss function (BW Loss) is designed to construct weights by counting pixels classified as clouds; thus, guiding the network to focus on features of small clouds and improving the overall detection accuracy. Cloudformer V2 is experimented on the dataset from GF-1 satellite and has excellent performance.

Keywords: cloud detection; remote-sensing images; transformer (search for similar items in EconPapers)
JEL-codes: C (search for similar items in EconPapers)
Date: 2022
References: View complete reference list from CitEc
Citations:

Downloads: (external link)
https://www.mdpi.com/2227-7390/10/15/2710/pdf (application/pdf)
https://www.mdpi.com/2227-7390/10/15/2710/ (text/html)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:gam:jmathe:v:10:y:2022:i:15:p:2710-:d:877031

Access Statistics for this article

Mathematics is currently edited by Ms. Emma He

More articles in Mathematics from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().