FedMDS: An Efficient Model Discrepancy-Aware Semi-asynchronous Clustered Federated Learning Framework

Federated learning (FL) is an emerging distributed machine learning paradigm that protects privacy and tackles the problem of isolated data islands. At present, there are two main communication strategies of FL: synchronous FL and asynchronous FL. The advantages of synchronous FL are that the model...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:IEEE transactions on parallel and distributed systems 2023-03, Vol.34 (3), p.1-14
Hauptverfasser: Zhang, Yu, Liu, Duo, Duan, Moming, Li, Li, Chen, Xianzhang, Ren, Ao, Tan, Yujuan, Wang, Chengliang
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Federated learning (FL) is an emerging distributed machine learning paradigm that protects privacy and tackles the problem of isolated data islands. At present, there are two main communication strategies of FL: synchronous FL and asynchronous FL. The advantages of synchronous FL are that the model has high precision and converges easily. However, this synchronous communication strategy has the risk that the central server waits too long for the devices, namely, the straggler effect. Asynchronous FL has a natural advantage in mitigating the straggler effect, but there are threats of model quality degradation and server crash. In this paper, we demonstrate the negative impact of the asynchronous update mechanism on the accuracy of the model and the phenomenon that the client waits too long due to the existence of stragglers in the synchronous update mechanism. Therefore, we propose a model discrepancy-aware semi-asynchronous clustered federated learning framework ( FedMDS ), which alleviates the straggler effect by 1) a clustered strategy based on the delay and direction of the model update. 2) a synchronous trigger mechanism that limits the model staleness. FedMDS leverages the clustered algorithm to reschedule the clients, and each group of clients performs asynchronous updates until the synchronous update mechanism based on the model discrepancy is triggered. We evaluate FedMDS based on four federated datasets in a non-IID setting and compare FedMDS to the baseline methods. The experimental results show that FedMDS significantly improves average test accuracy by more than +9.2\% on the four datasets compared to TA-FedAvg . In particular, FedMDS improves absolute Top-1 test accuracy by +37.6\% on FEMNIST compared to TA-FedAvg . The frequency of the average synchronization waiting time (ASWT) of FedMDS is significantly lower than that of T-FedAvg on four datasets. The experiment results show that FedMDS can improve the accuracy and alleviate the straggler effect.
ISSN:1045-9219
1558-2183
DOI:10.1109/TPDS.2023.3237752