EconPapers    
Economics at your fingertips  
 

FedISM: Enhancing Data Imbalance via Shared Model in Federated Learning

Wu-Chun Chung (), Yan-Hui Lin and Sih-Han Fang
Additional contact information
Wu-Chun Chung: Department of Information and Computer Engineering, Chung Yuan Christian University, Taoyuan 320, Taiwan
Yan-Hui Lin: Department of Information and Computer Engineering, Chung Yuan Christian University, Taoyuan 320, Taiwan
Sih-Han Fang: Department of Information and Computer Engineering, Chung Yuan Christian University, Taoyuan 320, Taiwan

Mathematics, 2023, vol. 11, issue 10, 1-22

Abstract: Considering the sensitivity of data in medical scenarios, federated learning (FL) is suitable for applications that require data privacy. Medical personnel can use the FL framework for machine learning to assist in analyzing large-scale data that are protected within the institution. However, not all clients have the same distribution of datasets, so data imbalance problems occur among clients. The main challenge is to overcome the performance degradation caused by low accuracy and the inability to converge the model. This paper proposes a FedISM method to enhance performance in the case of Non-Independent Identically Distribution (Non-IID). FedISM exploits a shared model trained on a candidate dataset before performing FL among clients. The Candidate Selection Mechanism (CSM) was proposed to effectively select the most suitable candidate among clients for training the shared model. Based on the proposed approaches, FedISM not only trains the shared model without sharing any raw data, but it also provides an optimal solution through the selection of the best shared model. To evaluate performance, the proposed FedISM was applied to classify coronavirus disease (COVID), pneumonia, normal, and viral pneumonia in the experiments. The Dirichlet process was also used to simulate a variety of imbalanced data distributions. Experimental results show that FedISM improves accuracy by up to 25% when privacy concerns regarding patient data are rising among medical institutions.

Keywords: federated learning; shared model; data imbalance; Non-IID; COVID-19 (search for similar items in EconPapers)
JEL-codes: C (search for similar items in EconPapers)
Date: 2023
References: View references in EconPapers View complete reference list from CitEc
Citations:

Downloads: (external link)
https://www.mdpi.com/2227-7390/11/10/2385/pdf (application/pdf)
https://www.mdpi.com/2227-7390/11/10/2385/ (text/html)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:gam:jmathe:v:11:y:2023:i:10:p:2385-:d:1151734

Access Statistics for this article

Mathematics is currently edited by Ms. Emma He

More articles in Mathematics from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().

 
Page updated 2025-03-19
Handle: RePEc:gam:jmathe:v:11:y:2023:i:10:p:2385-:d:1151734