Enhancing deep reinforcement learning for scale flexibility in real-time strategy games

Real-time strategy (RTS) games present a unique challenge for AI agents due to the combination of several fundamental AI problems. While Deep Reinforcement Learning (DRL) has shown promise in the development of autonomous agents for the genre, existing architectures often struggle with games featuri...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Entertainment computing 2025-01, Vol.52, p.100843, Article 100843
Hauptverfasser:	Lemos, Marcelo Luiz Harry Diniz, Vieira, Ronaldo e Silva, Tavares, Anderson Rocha, Marcolino, Leandro Soriano, Chaimowicz, Luiz
Format:	Artikel
Sprache:	eng
Schlagworte:	Deep Learning Game-playing AI Real-Time Strategy Games Reinforcement learning
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	Real-time strategy (RTS) games present a unique challenge for AI agents due to the combination of several fundamental AI problems. While Deep Reinforcement Learning (DRL) has shown promise in the development of autonomous agents for the genre, existing architectures often struggle with games featuring maps of varying dimensions. This limitation hinders the agent’s ability to generalize its learned strategies across different scenarios. This paper proposes a novel approach that overcomes this problem by incorporating Spatial Pyramid Pooling (SPP) within a DRL framework. We leverage the GridNet architecture’s encoder–decoder structure and integrate an SPP layer into the critic network of the Proximal Policy Optimization (PPO) algorithm. This SPP layer dynamically generates a standardized representation of the game state, regardless of the initial observation size. This allows the agent to effectively adapt its decision-making process to any map configuration. Our evaluations demonstrate that the proposed method significantly enhances the model’s flexibility and efficiency in training agents for various RTS game scenarios, albeit with some discernible limitations when applied to very small maps. This approach paves the way for more robust and adaptable AI agents capable of excelling in sequential decision problems with variable-size observations. •Scale-invariant architecture for DRL in RTS Games.•Enhanced agent’s flexibility and generalization across diverse scenarios.•Reduced training time through transfer learning.•Robust training by combining multiple map sizes.•Experiments on Gym-μRTS and Frozen Lake environments.
ISSN:	1875-9521 1875-953X
DOI:	10.1016/j.entcom.2024.100843