Macro Action Reinforcement Learning with Sequence Disentanglement using Variational Autoencoder
One problem in the application of reinforcement learning to real-world problems is the curse of dimensionality on the action space. Macro actions, a sequence of primitive actions, have been studied to diminish the dimensionality of the action space with regard to the time axis. However, previous stu...
Gespeichert in:
Hauptverfasser: | , , , |
---|---|
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | One problem in the application of reinforcement learning to real-world
problems is the curse of dimensionality on the action space. Macro actions, a
sequence of primitive actions, have been studied to diminish the dimensionality
of the action space with regard to the time axis. However, previous studies
relied on humans defining macro actions or assumed macro actions as repetitions
of the same primitive actions. We present Factorized Macro Action Reinforcement
Learning (FaMARL) which autonomously learns disentangled factor representation
of a sequence of actions to generate macro actions that can be directly applied
to general reinforcement learning algorithms. FaMARL exhibits higher scores
than other reinforcement learning algorithms on environments that require an
extensive amount of search. |
---|---|
DOI: | 10.48550/arxiv.1903.09366 |