User-Oriented Robust Reinforcement Learning

Recently, improving the robustness of policies across different environments attracts increasing attention in the reinforcement learning (RL) community. Existing robust RL methods mostly aim to achieve the max-min robustness by optimizing the policy's performance in the worst-case environment....

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	arXiv.org 2022-12
Hauptverfasser:	You, Haoyi, Yu, Beichen, Jin, Haiming, Yang, Zhaoxing, Sun, Jiahui
Format:	Artikel
Sprache:	eng
Schlagworte:	Algorithms Learning Optimization Performance measurement Policies Preferences Robustness
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Schreiben Sie den ersten Kommentar!