Efficient Federated Learning for Metaverse via Dynamic User Selection, Gradient Quantization and Resource Allocation

Metaverse is envisioned to merge the actual world with a virtual world to bring users unprecedented immersive feelings. To ensure user experience, federated learning (FL) has been expected as a critical enabler to provide metaverse users with high-quality sensing, communicating, and rendering. Howev...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	IEEE journal on selected areas in communications 2024-04, Vol.42 (4), p.850-866
Hauptverfasser:	Hou, Xiangwang, Wang, Jingjing, Jiang, Chunxiao, Meng, Zezhao, Chen, Jianrui, Ren, Yong
Format:	Artikel
Sprache:	eng
Schlagworte:	Convergence Energy consumption Federated learning gradient compression Iterative methods Markov processes Metaverse Optimization Quantization (signal) reinforcement learning Resource allocation Resource management Training Transmission error User experience User requirements Wireless communications
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	Metaverse is envisioned to merge the actual world with a virtual world to bring users unprecedented immersive feelings. To ensure user experience, federated learning (FL) has been expected as a critical enabler to provide metaverse users with high-quality sensing, communicating, and rendering. However, considering the limitation of wireless communication resources and the stringent requirements of users, collaborating with massive metaverse users to realize FL still has tremendous challenges. Most pioneer works on improving the performance of FL assume that the system states are static, which is unsuitable in the metaverse. Because the FL in the metaverse is always a complicated long-term iteration process, where the fluctuations of channel status and available computing resources of users are inevitable, a changeless strategy may lead to poor results. Therefore, this paper proposes an efficient FL scheme relying on dynamic user selection, gradient quantization, and resource allocation. Specifically, we derive the convergence error bound to reveal the impact of user selection, wireless transmission error, and gradient quantization error of each iteration on FL's convergence. Based on the theoretical analysis, we jointly and dynamically optimize the user selection, gradient quantization, and resource allocation to minimize the error bound with time and energy consumption budgets. Furthermore, to make the formulated sequential decision-making problem tractable, we transform it into a Markov decision process and design a soft actor-critic-based solution. Extensive experiments validate that our proposed scheme has superior performance compared to conventional schemes in dynamic-changing network environments.
ISSN:	0733-8716 1558-0008
DOI:	10.1109/JSAC.2023.3345393