Underwater Image Enhancement with a Hybrid U-Net-Transformer and Recurrent Multi-Scale Modulation
Zaiming Geng,
Jiabin Huang,
Xiaotian Wang (),
Yu Zhang,
Xinnan Fan and
Pengfei Shi
Additional contact information
Zaiming Geng: China Yangtze Power Co., Ltd., Yichang 443002, China
Jiabin Huang: State Grid Hangzhou Power Supply Company, Hangzhou 310016, China
Xiaotian Wang: College of Information Science and Engineering, Hohai University, Changzhou 213200, China
Yu Zhang: College of Artificial Intelligence and Automation, Hohai University, Changzhou 213200, China
Xinnan Fan: College of Information Science and Engineering, Hohai University, Changzhou 213200, China
Pengfei Shi: College of Artificial Intelligence and Automation, Hohai University, Changzhou 213200, China
Mathematics, 2025, vol. 13, issue 21, 1-16
Abstract:
The quality of underwater imagery is inherently degraded by light absorption and scattering, a challenge that severely limits its application in critical domains such as marine robotics and archeology. While existing enhancement methods, including recent hybrid models, attempt to address this, they often struggle to restore fine-grained details without introducing visual artifacts. To overcome this limitation, this work introduces a novel hybrid U-Net-Transformer (UTR) architecture that synergizes local feature extraction with global context modeling. The core innovation is a Recurrent Multi-Scale Feature Modulation (R-MSFM) mechanism, which, unlike prior recurrent refinement techniques, employs a gated modulation strategy across multiple feature scales within the decoder to iteratively refine textural and structural details with high fidelity. This approach effectively preserves spatial information during upsampling. Extensive experiments demonstrate the superiority of the proposed method. On the EUVP dataset, UTR achieves a PSNR of 28.347 dB, a significant gain of +3.947 dB over the state-of-the-art UWFormer. Moreover, it attains a top-ranking UIQM score of 3.059 on the UIEB dataset, underscoring its robustness. The results confirm that UTR provides a computationally efficient and highly effective solution for underwater image enhancement.
Keywords: underwater image enhancement; hybrid U-Net-Transformer; recurrent multi-scale feature modulation (search for similar items in EconPapers)
JEL-codes: C (search for similar items in EconPapers)
Date: 2025
References: Add references at CitEc
Citations:
Downloads: (external link)
https://www.mdpi.com/2227-7390/13/21/3398/pdf (application/pdf)
https://www.mdpi.com/2227-7390/13/21/3398/ (text/html)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:gam:jmathe:v:13:y:2025:i:21:p:3398-:d:1779415
Access Statistics for this article
Mathematics is currently edited by Ms. Emma He
More articles in Mathematics from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().