Designing Sun–Earth L2 Halo Orbit Stationkeeping Maneuvers via Reinforcement Learning

Reinforcement learning (RL) is used to design impulsive stationkeeping maneuvers for a spacecraft operating near an [Formula: see text] quasi-halo trajectory in a Sun–Earth–Moon point mass ephemeris model with solar radiation pressure. This scenario is translated into an RL problem that reflects the...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Journal of guidance, control, and dynamics control, and dynamics, 2023-02, Vol.46 (2), p.301-311
Hauptverfasser: Bonasera, Stefano, Bosanac, Natasha, Sullivan, Christopher J., Elliott, Ian, Ahmed, Nisar, McMahon, Jay W.
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Reinforcement learning (RL) is used to design impulsive stationkeeping maneuvers for a spacecraft operating near an [Formula: see text] quasi-halo trajectory in a Sun–Earth–Moon point mass ephemeris model with solar radiation pressure. This scenario is translated into an RL problem that reflects the desired stationkeeping goals, variables, and dynamical model. An algorithm from proximal policy optimization is used to train a policy that generates stationkeeping maneuvers while transfer learning is used to reduce the computational time required for training. The trained policy successfully generates stationkeeping maneuvers that result in boundedness to the vicinity of the selected reference trajectory with low total maneuver requirements, producing comparable results to a traditionally formulated constrained optimization scheme.
ISSN:0731-5090
1533-3884
DOI:10.2514/1.G006783