Deep Reinforcement Learning: A Survey

Deep reinforcement learning (DRL) integrates the feature representation ability of deep learning with the decision-making ability of reinforcement learning so that it can achieve powerful end-to-end learning control capabilities. In the past decade, DRL has made substantial advances in many tasks th...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	IEEE transaction on neural networks and learning systems 2024-04, Vol.35 (4), p.5064-5078
Hauptverfasser:	Wang, Xu, Wang, Sen, Liang, Xingxing, Zhao, Dawei, Huang, Jincai, Xu, Xin, Dai, Bin, Miao, Qiguang
Format:	Artikel
Sprache:	eng
Schlagworte:	Algorithms Behavioral sciences Control tasks Decision making Decision theory Deep learning deep reinforcement learning (DRL) Dynamic programming imitation learning Mathematical models Maximum entropy maximum entropy deep reinforcement learning (RL) Multiagent systems Observational learning policy gradient Q-learning Reinforcement Task analysis Trajectory value function
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	Deep reinforcement learning (DRL) integrates the feature representation ability of deep learning with the decision-making ability of reinforcement learning so that it can achieve powerful end-to-end learning control capabilities. In the past decade, DRL has made substantial advances in many tasks that require perceiving high-dimensional input and making optimal or near-optimal decisions. However, there are still many challenging problems in the theory and applications of DRL, especially in learning control tasks with limited samples, sparse rewards, and multiple agents. Researchers have proposed various solutions and new theories to solve these problems and promote the development of DRL. In addition, deep learning has stimulated the further development of many subfields of reinforcement learning, such as hierarchical reinforcement learning (HRL), multiagent reinforcement learning, and imitation learning. This article gives a comprehensive overview of the fundamental theories, key algorithms, and primary research domains of DRL. In addition to value-based and policy-based DRL algorithms, the advances in maximum entropy-based DRL are summarized. The future research topics of DRL are also analyzed and discussed.
ISSN:	2162-237X 2162-2388
DOI:	10.1109/TNNLS.2022.3207346