HYBRID ONLINE POLICY ADAPTATION STRATEGY FOR ATTITUDE POINTING PERFORMANCE

This specification relates to systems, methods and apparatus for controlling a satellite using machine-learning models, and the training of such machine-learning models. According to a first aspect of this specification there is described a computer implemented method for controlling a satellite com...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: WATT, Mark, LACHEVRE, Pierre, GARCÍA, Carlos Hervás, PASSARIN, Federico
Format: Patent
Sprache:eng ; fre ; ger
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title
container_volume
creator WATT, Mark
LACHEVRE, Pierre
GARCÍA, Carlos Hervás
PASSARIN, Federico
description This specification relates to systems, methods and apparatus for controlling a satellite using machine-learning models, and the training of such machine-learning models. According to a first aspect of this specification there is described a computer implemented method for controlling a satellite comprising: for at least one machine learning model for controlling at least the attitude of the satellite controlling a satellite for an episode of time using said machine-learning model. The controlling comprises: computing, by one or more processors of the satellite, a nominal control command for the satellite from data representing the current state of the satellite using a predefined nominal control model; generating, by the one or more processors and using said machine learning model, one or more corrections to said nominal control command using the current attitude state of the satellite and said nominal control command (5.3); generating, by the one or more processors, an effective control command by applying said correction to said first predefined nominal control command; and controlling the satellite based on said effective control. The method further comprises evaluating, by the one or more processors, a performance of the effective control commands for the episode using a reward function providing rewards (5.4); and updating, by the one or more processors, said machine learning model based on the rewards, wherein updates are determined using a metaheuristic optimisation algorithm (5.5).
format Patent
fullrecord <record><control><sourceid>epo_EVB</sourceid><recordid>TN_cdi_epo_espacenet_EP4408749A1</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>EP4408749A1</sourcerecordid><originalsourceid>FETCH-epo_espacenet_EP4408749A13</originalsourceid><addsrcrecordid>eNrjZPDyiHQK8nRR8Pfz8fRzVQjw9_F0jlRwdHEMCHEM8fT3UwgOCXIMcXWPVHDzD1JwDAnxDAl1Aanz9Avx9HNXCHANAkr4Ovo5u_IwsKYl5hSn8kJpbgYFN9cQZw_d1IL8-NTigsTk1LzUknjXABMTAwtzE0tHQ2MilAAANlwtdQ</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>patent</recordtype></control><display><type>patent</type><title>HYBRID ONLINE POLICY ADAPTATION STRATEGY FOR ATTITUDE POINTING PERFORMANCE</title><source>esp@cenet</source><creator>WATT, Mark ; LACHEVRE, Pierre ; GARCÍA, Carlos Hervás ; PASSARIN, Federico</creator><creatorcontrib>WATT, Mark ; LACHEVRE, Pierre ; GARCÍA, Carlos Hervás ; PASSARIN, Federico</creatorcontrib><description>This specification relates to systems, methods and apparatus for controlling a satellite using machine-learning models, and the training of such machine-learning models. According to a first aspect of this specification there is described a computer implemented method for controlling a satellite comprising: for at least one machine learning model for controlling at least the attitude of the satellite controlling a satellite for an episode of time using said machine-learning model. The controlling comprises: computing, by one or more processors of the satellite, a nominal control command for the satellite from data representing the current state of the satellite using a predefined nominal control model; generating, by the one or more processors and using said machine learning model, one or more corrections to said nominal control command using the current attitude state of the satellite and said nominal control command (5.3); generating, by the one or more processors, an effective control command by applying said correction to said first predefined nominal control command; and controlling the satellite based on said effective control. The method further comprises evaluating, by the one or more processors, a performance of the effective control commands for the episode using a reward function providing rewards (5.4); and updating, by the one or more processors, said machine learning model based on the rewards, wherein updates are determined using a metaheuristic optimisation algorithm (5.5).</description><language>eng ; fre ; ger</language><subject>AIRCRAFT ; AVIATION ; COSMONAUTICS ; PERFORMING OPERATIONS ; TRANSPORTING ; VEHICLES OR EQUIPMENT THEREFOR</subject><creationdate>2024</creationdate><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&amp;date=20240807&amp;DB=EPODOC&amp;CC=EP&amp;NR=4408749A1$$EHTML$$P50$$Gepo$$Hfree_for_read</linktohtml><link.rule.ids>230,308,776,881,25542,76289</link.rule.ids><linktorsrc>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&amp;date=20240807&amp;DB=EPODOC&amp;CC=EP&amp;NR=4408749A1$$EView_record_in_European_Patent_Office$$FView_record_in_$$GEuropean_Patent_Office$$Hfree_for_read</linktorsrc></links><search><creatorcontrib>WATT, Mark</creatorcontrib><creatorcontrib>LACHEVRE, Pierre</creatorcontrib><creatorcontrib>GARCÍA, Carlos Hervás</creatorcontrib><creatorcontrib>PASSARIN, Federico</creatorcontrib><title>HYBRID ONLINE POLICY ADAPTATION STRATEGY FOR ATTITUDE POINTING PERFORMANCE</title><description>This specification relates to systems, methods and apparatus for controlling a satellite using machine-learning models, and the training of such machine-learning models. According to a first aspect of this specification there is described a computer implemented method for controlling a satellite comprising: for at least one machine learning model for controlling at least the attitude of the satellite controlling a satellite for an episode of time using said machine-learning model. The controlling comprises: computing, by one or more processors of the satellite, a nominal control command for the satellite from data representing the current state of the satellite using a predefined nominal control model; generating, by the one or more processors and using said machine learning model, one or more corrections to said nominal control command using the current attitude state of the satellite and said nominal control command (5.3); generating, by the one or more processors, an effective control command by applying said correction to said first predefined nominal control command; and controlling the satellite based on said effective control. The method further comprises evaluating, by the one or more processors, a performance of the effective control commands for the episode using a reward function providing rewards (5.4); and updating, by the one or more processors, said machine learning model based on the rewards, wherein updates are determined using a metaheuristic optimisation algorithm (5.5).</description><subject>AIRCRAFT</subject><subject>AVIATION</subject><subject>COSMONAUTICS</subject><subject>PERFORMING OPERATIONS</subject><subject>TRANSPORTING</subject><subject>VEHICLES OR EQUIPMENT THEREFOR</subject><fulltext>true</fulltext><rsrctype>patent</rsrctype><creationdate>2024</creationdate><recordtype>patent</recordtype><sourceid>EVB</sourceid><recordid>eNrjZPDyiHQK8nRR8Pfz8fRzVQjw9_F0jlRwdHEMCHEM8fT3UwgOCXIMcXWPVHDzD1JwDAnxDAl1Aanz9Avx9HNXCHANAkr4Ovo5u_IwsKYl5hSn8kJpbgYFN9cQZw_d1IL8-NTigsTk1LzUknjXABMTAwtzE0tHQ2MilAAANlwtdQ</recordid><startdate>20240807</startdate><enddate>20240807</enddate><creator>WATT, Mark</creator><creator>LACHEVRE, Pierre</creator><creator>GARCÍA, Carlos Hervás</creator><creator>PASSARIN, Federico</creator><scope>EVB</scope></search><sort><creationdate>20240807</creationdate><title>HYBRID ONLINE POLICY ADAPTATION STRATEGY FOR ATTITUDE POINTING PERFORMANCE</title><author>WATT, Mark ; LACHEVRE, Pierre ; GARCÍA, Carlos Hervás ; PASSARIN, Federico</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-epo_espacenet_EP4408749A13</frbrgroupid><rsrctype>patents</rsrctype><prefilter>patents</prefilter><language>eng ; fre ; ger</language><creationdate>2024</creationdate><topic>AIRCRAFT</topic><topic>AVIATION</topic><topic>COSMONAUTICS</topic><topic>PERFORMING OPERATIONS</topic><topic>TRANSPORTING</topic><topic>VEHICLES OR EQUIPMENT THEREFOR</topic><toplevel>online_resources</toplevel><creatorcontrib>WATT, Mark</creatorcontrib><creatorcontrib>LACHEVRE, Pierre</creatorcontrib><creatorcontrib>GARCÍA, Carlos Hervás</creatorcontrib><creatorcontrib>PASSARIN, Federico</creatorcontrib><collection>esp@cenet</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>WATT, Mark</au><au>LACHEVRE, Pierre</au><au>GARCÍA, Carlos Hervás</au><au>PASSARIN, Federico</au><format>patent</format><genre>patent</genre><ristype>GEN</ristype><title>HYBRID ONLINE POLICY ADAPTATION STRATEGY FOR ATTITUDE POINTING PERFORMANCE</title><date>2024-08-07</date><risdate>2024</risdate><abstract>This specification relates to systems, methods and apparatus for controlling a satellite using machine-learning models, and the training of such machine-learning models. According to a first aspect of this specification there is described a computer implemented method for controlling a satellite comprising: for at least one machine learning model for controlling at least the attitude of the satellite controlling a satellite for an episode of time using said machine-learning model. The controlling comprises: computing, by one or more processors of the satellite, a nominal control command for the satellite from data representing the current state of the satellite using a predefined nominal control model; generating, by the one or more processors and using said machine learning model, one or more corrections to said nominal control command using the current attitude state of the satellite and said nominal control command (5.3); generating, by the one or more processors, an effective control command by applying said correction to said first predefined nominal control command; and controlling the satellite based on said effective control. The method further comprises evaluating, by the one or more processors, a performance of the effective control commands for the episode using a reward function providing rewards (5.4); and updating, by the one or more processors, said machine learning model based on the rewards, wherein updates are determined using a metaheuristic optimisation algorithm (5.5).</abstract><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier
ispartof
issn
language eng ; fre ; ger
recordid cdi_epo_espacenet_EP4408749A1
source esp@cenet
subjects AIRCRAFT
AVIATION
COSMONAUTICS
PERFORMING OPERATIONS
TRANSPORTING
VEHICLES OR EQUIPMENT THEREFOR
title HYBRID ONLINE POLICY ADAPTATION STRATEGY FOR ATTITUDE POINTING PERFORMANCE
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-08T22%3A31%3A50IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-epo_EVB&rft_val_fmt=info:ofi/fmt:kev:mtx:patent&rft.genre=patent&rft.au=WATT,%20Mark&rft.date=2024-08-07&rft_id=info:doi/&rft_dat=%3Cepo_EVB%3EEP4408749A1%3C/epo_EVB%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true