EconPapers    
Economics at your fingertips  
 

Progressively Multi-Scale Feature Fusion for Image Inpainting

Wu Wen, Tianhao Li, Amr Tolba (), Ziyi Liu and Kai Shao
Additional contact information
Wu Wen: School of Communication and Information Engineering, Chongqing University of Posts and Telecommunications, Chongqing 400065, China
Tianhao Li: School of Communication and Information Engineering, Chongqing University of Posts and Telecommunications, Chongqing 400065, China
Amr Tolba: Computer Science Department, Community College, King Saud University, Riyadh 11437, Saudi Arabia
Ziyi Liu: School of Communication and Information Engineering, Chongqing University of Posts and Telecommunications, Chongqing 400065, China
Kai Shao: School of Software, Dalian University of Technology, Dalian 116024, China

Mathematics, 2023, vol. 11, issue 24, 1-20

Abstract: The rapid advancement of Wise Information Technology of med (WITMED) has made the integration of traditional Chinese medicine tongue diagnosis and computer technology an increasingly significant area of research. The doctor obtains patient’s tongue images to make a further diagnosis. However, the tongue image may be broken during the process of collecting the tongue image. Due to the extremely complex texture of the tongue and significant individual differences, existing methods fail to fully obtain sufficient feature information, which result in inaccurate inpainted tongue images. To address this problem, we propose a recurrent tongue image inpainting algorithm based on multi-scale feature fusion called Multi-Scale Fusion Module and Recurrent Attention Mechanism Network (MSFM-RAM-Net). We first propose Multi-Scale Fusion Module (MSFM), which preserves the feature information of tongue images at different scales and enhances the consistency between structures. To simultaneously accelerate the inpainting process and enhance the quality of the inpainted results, Recurrent Attention Mechanism (RAM) is proposed. RAM focuses the network’s attention on important areas and uses known information to gradually inpaint image, which can avoid redundant feature information and the problem of texture confusion caused by large missing areas. Finally, we establish a tongue image dataset and use this dataset to qualitatively and quantitatively evaluate the MSFM-RAM-Net. The results shows that the MSFM-RAM-Net has a better effect on tongue image inpainting, with PSNR and SSIM increasing by 2.1% and 3.3%, respectively.

Keywords: tongue image inpainting; MSFM; RAM; tongue image dataset (search for similar items in EconPapers)
JEL-codes: C (search for similar items in EconPapers)
Date: 2023
References: View complete reference list from CitEc
Citations:

Downloads: (external link)
https://www.mdpi.com/2227-7390/11/24/4908/pdf (application/pdf)
https://www.mdpi.com/2227-7390/11/24/4908/ (text/html)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:gam:jmathe:v:11:y:2023:i:24:p:4908-:d:1296888

Access Statistics for this article

Mathematics is currently edited by Ms. Emma He

More articles in Mathematics from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().

 
Page updated 2025-03-19
Handle: RePEc:gam:jmathe:v:11:y:2023:i:24:p:4908-:d:1296888