MACHINE LEARNING TECHNIQUES FOR REINFORCEMENT LEARNING USING CROSS-SUPPORT LIKELIHOOD MODEL SIMILARITY DETERMINATIONS

Various embodiments of the present invention introduce technical advantages related to computational efficiency and storage efficiency of training reinforcement learning models using model-based reinforcement learning approaches. For example, various embodiments of the present invention enable train...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Hussain, Reem A, Patel, Yagnesh J, Nori, Vijay S
Format: Patent
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Various embodiments of the present invention introduce technical advantages related to computational efficiency and storage efficiency of training reinforcement learning models using model-based reinforcement learning approaches. For example, various embodiments of the present invention enable training components of a dynamics model of a reinforcement learning framework using cross-space likelihood similarity measures between predicted transition likelihood models and empirical transition likelihood models even when the two noted likelihood models have distinct distribution supports. This enables using training/empirical observation data to train dynamics model components even when the output state spaces of the dynamics model components are distinct from the output state space of the empirical distributions determined using the training/empirical observation data.