Preserving Expert-Level Privacy in Offline Reinforcement Learning

The offline reinforcement learning (RL) problem aims to learn an optimal policy from historical data collected by one or more behavioural policies (experts) by interacting with an environment. However, the individual experts may be privacy-sensitive in that the learnt policy may retain information a...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Sharma, Navodita, Vinod, Vishnu, Thakurta, Abhradeep, Agarwal, Alekh, Balle, Borja, Dann, Christoph, Raghuveer, Aravindan
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!