Dynamic Mirror Descent based Model Predictive Control for Accelerating Robot Learning

Recent works in Reinforcement Learning (RL) combine model-free (Mf)-RL algorithms with model-based (Mb)-RL approaches to get the best from both: asymptotic performance of Mf-RL and high sample-efficiency of Mb-RL. Inspired by these works, we propose a hierarchical framework that integrates online le...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Mishra, Utkarsh A, Samineni, Soumya R, Goel, Prakhar, Kunjeti, Chandravaran, Lodha, Himanshu, Singh, Aman, Sagi, Aditya, Bhatnagar, Shalabh, Kolathaya, Shishir
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!