Heterogeneous multi-Agent reinforcement learning algorithm integrating Prior-knowledge

In recent years, the breakthrough of machine learning based on deep reinforcement learning provides a new development direction for intelligent game confrontation. In order to solve the problems of slow convergence speed and great difference in training effect of heterogeneous multi-agent reinforcem...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Zhihui Kongzhi Yu Fangzhen 2023-06, Vol.45 (3), p.99-107
1. Verfasser:	ZHOU Jiawei, SUN Yuxiang, XUE Yufan, XIANG Qi, WU Ying, ZHOU Xianzhong
Format:	Artikel
Sprache:	chi
Schlagworte:	reinforcement learning\|intelligent game\|wargaming\|maddpg\|multi-agent collaboration
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	In recent years, the breakthrough of machine learning based on deep reinforcement learning provides a new development direction for intelligent game confrontation. In order to solve the problems of slow convergence speed and great difference in training effect of heterogeneous multi-agent reinforcement learning algorithm in intelligent confrontation, this paper proposes a priori knowledge-driven multi-agent reinforcement learning game antagonism algorithm PK-MADDPG, and constructs a MADDPG model under the framework of double Critic. The model uses the experience first replay technique to optimize the prior knowledge extraction, thus achieving remarkable results in the training of game confrontation. In the national competition of MaCA heterogeneous multi-agent game confrontation, the paper compares the game confrontation results of PK-MADDPG algorithm with classical rule algorithm, and verifies the effectiveness of the algorithm proposed in this paper.
ISSN:	1673-3819
DOI:	10.3969/j.issn.1673-3819.2023.03.015