Accelerating Deep Reinforcement Learning via Phase-Level Parallelism for Robotics Applications

Deep Reinforcement Learning (DRL) plays a critical role in controlling future intelligent machines like robots and drones. Constantly retrained by newly arriving real-world data, DRL provides optimal autonomous control solutions for adapting to ever-changing environments. However, DRL repeats infere...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	IEEE computer architecture letters 2024-01, Vol.23 (1), p.41-44
Hauptverfasser:	Kim, Yang-Gon, Han, Yun-Ki, Shin, Jae-Kang, Kim, Jun-Kyum, Kim, Lee-Sup
Format:	Artikel
Sprache:	eng
Schlagworte:	Algorithms Backpropagation Changing environments Computer systems organization Deep learning Graphics processing units Hardware Inference Legged locomotion mobile computing neural nets Parallel processing Reinforcement learning Robot learning Robotics Robots Training
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	Deep Reinforcement Learning (DRL) plays a critical role in controlling future intelligent machines like robots and drones. Constantly retrained by newly arriving real-world data, DRL provides optimal autonomous control solutions for adapting to ever-changing environments. However, DRL repeats inference and training that are computationally expensive on resource-constraint mobile/embedded platforms. Even worse, DRL produces a severe hardware underutilization problem due to its unique execution pattern. To overcome the inefficiency of DRL, we propose Train Early Start , a new execution pattern for building the efficient DRL algorithm. Train Early Start parallelizes the inference and training execution, hiding the serialized performance bottleneck and improving the hardware utilization dramatically. Compared to the state-of-the-art mobile SoC, Train Early Start achieves 1.42x speedup and 1.13x energy efficiency.
ISSN:	1556-6056 1556-6064
DOI:	10.1109/LCA.2023.3341152