Research on privacy protection of multi source data based on improved gbdt federated ensemble method with different metrics

Federated learning is a hot topic in the field of multi-source data. It has the advantage that the data cannot be localized, and it has the problems that the parameters of the local model are difficult to integrate and the accuracy of the model is low. In this paper, an improved gradient boosting de...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Physical communication 2021-12, Vol.49, p.101347, Article 101347
Hauptverfasser:	Luo, Changyin, Chen, Xuebin, Xu, Jingcheng, Zhang, Shufen
Format:	Artikel
Sprache:	eng
Schlagworte:	Engineering Engineering, Electrical & Electronic Federated ensemble method Federated learning GBDT Science & Technology Similar samples Technology Telecommunications
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	Federated learning is a hot topic in the field of multi-source data. It has the advantage that the data cannot be localized, and it has the problems that the parameters of the local model are difficult to integrate and the accuracy of the model is low. In this paper, an improved gradient boosting decision tree (GBDT) federated ensemble learning method is proposed, which takes the average gradient of similar samples and its own gradient as a new gradient to improve the accuracy of the local model. Different ensemble learning methods are used to integrate the parameters of the local model, thus improving the accuracy of the updated global model. The experimental results show that the accuracy of the global model trained by the method on magic-gamma-telescope datasets and eeg-eye-state datasets is compared with that of GBDT federated ensemble method and traditional multi-source data processing technology without finding similar samples. The accuracy of the model is improved by 0.6%, 0.45% and 0.27% and 0.08% respectively, which makes it possible to improve the security of data and model while improving the accuracy of the model.
ISSN:	1874-4907 1876-3219
DOI:	10.1016/j.phycom.2021.101347