Onboard fuel-optimal guidance for human-Mars entry, powered-descent, and landing mission based on feature learning
This paper investigates the end-to-end human-Mars entry, powered-descent, and landing (EDL) guidance problem by developing an on-board learning-based optimal control method (L-OCM) to achieve the goal of precise and fuel-efficient planetary landing. First, the end-to-end EDL guidance problem is form...
Gespeichert in:
Veröffentlicht in: | Acta astronautica 2022-06, Vol.195, p.129-144 |
---|---|
Hauptverfasser: | , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | This paper investigates the end-to-end human-Mars entry, powered-descent, and landing (EDL) guidance problem by developing an on-board learning-based optimal control method (L-OCM) to achieve the goal of precise and fuel-efficient planetary landing. First, the end-to-end EDL guidance problem is formulated as a multi-phase optimal control problem with hybrid dynamics and constraints. Then a customized alternating direction method of multipliers is applied to solve the end-to-end EDL guidance problem with varying initial states off-line. After that, the L-OCM is developed to generate real-time optimal guidance commands. To be specific, supported by the optimal control theory, the necessary conditions of optimality for optimal control of the entry phase and powered-descent phase are derived, respectively, which leads to two two-point-boundary-value-problems (TPBVPs). Then, critical parameters are identified to approximate the complete solutions of the TPBVPs. To find the implicit relationship between the initial states and these critical parameters, deep neural networks are constructed to learn the values of these critical parameters in real-time with training data obtained from the off-line solutions. Furthermore, when random disturbances are considered during the EDL process, the single stage L-OCM is extended to the multi-stage L-OCM to regenerate real-time optimal guidance commands. Finally, the proposed L-OCM is implemented in extensive simulation cases to verify the effectiveness and efficiency of the new method.
•We consider the multi-phase guidance problem integrating the entry and powered descent phases.•The learning method only needs to learn identified critical parameters to reduce computation load.•The proposed approach can handle multi-type disturbances during the EDL mission.•The proposed method can achieve precise and fuel-efficient planetary landing in real-time. |
---|---|
ISSN: | 0094-5765 1879-2030 |
DOI: | 10.1016/j.actaastro.2022.02.007 |