A comparison between UCB and UCB-Tuned as selection policies in GGP

In this paper, we present a comparative analysis of two selection policies in the General Game Playing (GGP) context: Upper Confidence Bound (UCB) and Upper Confidence Bound Tuned (UCB-Tuned). The aim of the analysis is to identify which policy has the best performance in terms of victories in the G...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Journal of intelligent & fuzzy systems 2019-01, Vol.36 (5), p.5073-5079
Hauptverfasser:	Francisco-Valencia, Iván, Marcial-Romero, José Raymundo, Valdovinos-Rosas, Rosa María
Format:	Artikel
Sprache:	eng
Schlagworte:	Computer simulation Policies
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Schreiben Sie den ersten Kommentar!