Deep reinforcement learning based models for hard-exploration problems

A self-driving vehicle implements a deep reinforcement learning based model. The self-driving vehicle comprise one or more sensors configured to capture sensor data of an environment of the self-driving vehicle, a control system configured to navigate the self-driving vehicle, and a controller to de...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Huizinga, Joost, Lehman, Joel Anthony, Stanley, Kenneth Owen, Ecoffet, Adrien Lucas, Clune, Jeffrey Michael
Format: Patent
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title
container_volume
creator Huizinga, Joost
Lehman, Joel Anthony
Stanley, Kenneth Owen
Ecoffet, Adrien Lucas
Clune, Jeffrey Michael
description A self-driving vehicle implements a deep reinforcement learning based model. The self-driving vehicle comprise one or more sensors configured to capture sensor data of an environment of the self-driving vehicle, a control system configured to navigate the self-driving vehicle, and a controller to determine and provide instructions to the control system. The controller implements a deep reinforcement learning based model that inputs the sensor data captured by the sensors to determine actions to perform by the control system. The model includes an archive storing states reachable by an agent in a training environment, each state stored in the archive is associated with a trajectory for reaching the state. The archive is generated by visiting states stored in the archive and performing actions to explore and find new states. New states are stored in the archive with their trajectories.
format Patent
fullrecord <record><control><sourceid>epo_EVB</sourceid><recordid>TN_cdi_epo_espacenet_US11829870B2</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>US11829870B2</sourcerecordid><originalsourceid>FETCH-epo_espacenet_US11829870B23</originalsourceid><addsrcrecordid>eNqNyjsOwjAQRVE3FAjYw7CASCQUhJZPRA_U0SR-CZZsjzV2wfKhYAFUtzh3aboLkEjh4iQ6IiAW8mCNLs40cIalIBY-09fpxWorvJMX5eIkUlIZPEJem8XEPmPz68psu-vjfKuQpEdOPCKi9M97XbfNsT3sTs3-n-cDYAk0Nw</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>patent</recordtype></control><display><type>patent</type><title>Deep reinforcement learning based models for hard-exploration problems</title><source>esp@cenet</source><creator>Huizinga, Joost ; Lehman, Joel Anthony ; Stanley, Kenneth Owen ; Ecoffet, Adrien Lucas ; Clune, Jeffrey Michael</creator><creatorcontrib>Huizinga, Joost ; Lehman, Joel Anthony ; Stanley, Kenneth Owen ; Ecoffet, Adrien Lucas ; Clune, Jeffrey Michael</creatorcontrib><description>A self-driving vehicle implements a deep reinforcement learning based model. The self-driving vehicle comprise one or more sensors configured to capture sensor data of an environment of the self-driving vehicle, a control system configured to navigate the self-driving vehicle, and a controller to determine and provide instructions to the control system. The controller implements a deep reinforcement learning based model that inputs the sensor data captured by the sensors to determine actions to perform by the control system. The model includes an archive storing states reachable by an agent in a training environment, each state stored in the archive is associated with a trajectory for reaching the state. The archive is generated by visiting states stored in the archive and performing actions to explore and find new states. New states are stored in the archive with their trajectories.</description><language>eng</language><subject>CALCULATING ; COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS ; COMPUTING ; CONJOINT CONTROL OF VEHICLE SUB-UNITS OF DIFFERENT TYPE ORDIFFERENT FUNCTION ; CONTROL OR REGULATING SYSTEMS IN GENERAL ; CONTROL SYSTEMS SPECIALLY ADAPTED FOR HYBRID VEHICLES ; CONTROLLING ; COUNTING ; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS ; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS ORELEMENTS ; PERFORMING OPERATIONS ; PHYSICS ; REGULATING ; ROAD VEHICLE DRIVE CONTROL SYSTEMS FOR PURPOSES NOT RELATED TOTHE CONTROL OF A PARTICULAR SUB-UNIT ; SYSTEMS FOR CONTROLLING OR REGULATING NON-ELECTRIC VARIABLES ; TRANSPORTING ; VEHICLES IN GENERAL</subject><creationdate>2023</creationdate><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&amp;date=20231128&amp;DB=EPODOC&amp;CC=US&amp;NR=11829870B2$$EHTML$$P50$$Gepo$$Hfree_for_read</linktohtml><link.rule.ids>230,308,776,881,25542,76290</link.rule.ids><linktorsrc>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&amp;date=20231128&amp;DB=EPODOC&amp;CC=US&amp;NR=11829870B2$$EView_record_in_European_Patent_Office$$FView_record_in_$$GEuropean_Patent_Office$$Hfree_for_read</linktorsrc></links><search><creatorcontrib>Huizinga, Joost</creatorcontrib><creatorcontrib>Lehman, Joel Anthony</creatorcontrib><creatorcontrib>Stanley, Kenneth Owen</creatorcontrib><creatorcontrib>Ecoffet, Adrien Lucas</creatorcontrib><creatorcontrib>Clune, Jeffrey Michael</creatorcontrib><title>Deep reinforcement learning based models for hard-exploration problems</title><description>A self-driving vehicle implements a deep reinforcement learning based model. The self-driving vehicle comprise one or more sensors configured to capture sensor data of an environment of the self-driving vehicle, a control system configured to navigate the self-driving vehicle, and a controller to determine and provide instructions to the control system. The controller implements a deep reinforcement learning based model that inputs the sensor data captured by the sensors to determine actions to perform by the control system. The model includes an archive storing states reachable by an agent in a training environment, each state stored in the archive is associated with a trajectory for reaching the state. The archive is generated by visiting states stored in the archive and performing actions to explore and find new states. New states are stored in the archive with their trajectories.</description><subject>CALCULATING</subject><subject>COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS</subject><subject>COMPUTING</subject><subject>CONJOINT CONTROL OF VEHICLE SUB-UNITS OF DIFFERENT TYPE ORDIFFERENT FUNCTION</subject><subject>CONTROL OR REGULATING SYSTEMS IN GENERAL</subject><subject>CONTROL SYSTEMS SPECIALLY ADAPTED FOR HYBRID VEHICLES</subject><subject>CONTROLLING</subject><subject>COUNTING</subject><subject>FUNCTIONAL ELEMENTS OF SUCH SYSTEMS</subject><subject>MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS ORELEMENTS</subject><subject>PERFORMING OPERATIONS</subject><subject>PHYSICS</subject><subject>REGULATING</subject><subject>ROAD VEHICLE DRIVE CONTROL SYSTEMS FOR PURPOSES NOT RELATED TOTHE CONTROL OF A PARTICULAR SUB-UNIT</subject><subject>SYSTEMS FOR CONTROLLING OR REGULATING NON-ELECTRIC VARIABLES</subject><subject>TRANSPORTING</subject><subject>VEHICLES IN GENERAL</subject><fulltext>true</fulltext><rsrctype>patent</rsrctype><creationdate>2023</creationdate><recordtype>patent</recordtype><sourceid>EVB</sourceid><recordid>eNqNyjsOwjAQRVE3FAjYw7CASCQUhJZPRA_U0SR-CZZsjzV2wfKhYAFUtzh3aboLkEjh4iQ6IiAW8mCNLs40cIalIBY-09fpxWorvJMX5eIkUlIZPEJem8XEPmPz68psu-vjfKuQpEdOPCKi9M97XbfNsT3sTs3-n-cDYAk0Nw</recordid><startdate>20231128</startdate><enddate>20231128</enddate><creator>Huizinga, Joost</creator><creator>Lehman, Joel Anthony</creator><creator>Stanley, Kenneth Owen</creator><creator>Ecoffet, Adrien Lucas</creator><creator>Clune, Jeffrey Michael</creator><scope>EVB</scope></search><sort><creationdate>20231128</creationdate><title>Deep reinforcement learning based models for hard-exploration problems</title><author>Huizinga, Joost ; Lehman, Joel Anthony ; Stanley, Kenneth Owen ; Ecoffet, Adrien Lucas ; Clune, Jeffrey Michael</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-epo_espacenet_US11829870B23</frbrgroupid><rsrctype>patents</rsrctype><prefilter>patents</prefilter><language>eng</language><creationdate>2023</creationdate><topic>CALCULATING</topic><topic>COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS</topic><topic>COMPUTING</topic><topic>CONJOINT CONTROL OF VEHICLE SUB-UNITS OF DIFFERENT TYPE ORDIFFERENT FUNCTION</topic><topic>CONTROL OR REGULATING SYSTEMS IN GENERAL</topic><topic>CONTROL SYSTEMS SPECIALLY ADAPTED FOR HYBRID VEHICLES</topic><topic>CONTROLLING</topic><topic>COUNTING</topic><topic>FUNCTIONAL ELEMENTS OF SUCH SYSTEMS</topic><topic>MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS ORELEMENTS</topic><topic>PERFORMING OPERATIONS</topic><topic>PHYSICS</topic><topic>REGULATING</topic><topic>ROAD VEHICLE DRIVE CONTROL SYSTEMS FOR PURPOSES NOT RELATED TOTHE CONTROL OF A PARTICULAR SUB-UNIT</topic><topic>SYSTEMS FOR CONTROLLING OR REGULATING NON-ELECTRIC VARIABLES</topic><topic>TRANSPORTING</topic><topic>VEHICLES IN GENERAL</topic><toplevel>online_resources</toplevel><creatorcontrib>Huizinga, Joost</creatorcontrib><creatorcontrib>Lehman, Joel Anthony</creatorcontrib><creatorcontrib>Stanley, Kenneth Owen</creatorcontrib><creatorcontrib>Ecoffet, Adrien Lucas</creatorcontrib><creatorcontrib>Clune, Jeffrey Michael</creatorcontrib><collection>esp@cenet</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Huizinga, Joost</au><au>Lehman, Joel Anthony</au><au>Stanley, Kenneth Owen</au><au>Ecoffet, Adrien Lucas</au><au>Clune, Jeffrey Michael</au><format>patent</format><genre>patent</genre><ristype>GEN</ristype><title>Deep reinforcement learning based models for hard-exploration problems</title><date>2023-11-28</date><risdate>2023</risdate><abstract>A self-driving vehicle implements a deep reinforcement learning based model. The self-driving vehicle comprise one or more sensors configured to capture sensor data of an environment of the self-driving vehicle, a control system configured to navigate the self-driving vehicle, and a controller to determine and provide instructions to the control system. The controller implements a deep reinforcement learning based model that inputs the sensor data captured by the sensors to determine actions to perform by the control system. The model includes an archive storing states reachable by an agent in a training environment, each state stored in the archive is associated with a trajectory for reaching the state. The archive is generated by visiting states stored in the archive and performing actions to explore and find new states. New states are stored in the archive with their trajectories.</abstract><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier
ispartof
issn
language eng
recordid cdi_epo_espacenet_US11829870B2
source esp@cenet
subjects CALCULATING
COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
COMPUTING
CONJOINT CONTROL OF VEHICLE SUB-UNITS OF DIFFERENT TYPE ORDIFFERENT FUNCTION
CONTROL OR REGULATING SYSTEMS IN GENERAL
CONTROL SYSTEMS SPECIALLY ADAPTED FOR HYBRID VEHICLES
CONTROLLING
COUNTING
FUNCTIONAL ELEMENTS OF SUCH SYSTEMS
MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS ORELEMENTS
PERFORMING OPERATIONS
PHYSICS
REGULATING
ROAD VEHICLE DRIVE CONTROL SYSTEMS FOR PURPOSES NOT RELATED TOTHE CONTROL OF A PARTICULAR SUB-UNIT
SYSTEMS FOR CONTROLLING OR REGULATING NON-ELECTRIC VARIABLES
TRANSPORTING
VEHICLES IN GENERAL
title Deep reinforcement learning based models for hard-exploration problems
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-31T07%3A55%3A25IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-epo_EVB&rft_val_fmt=info:ofi/fmt:kev:mtx:patent&rft.genre=patent&rft.au=Huizinga,%20Joost&rft.date=2023-11-28&rft_id=info:doi/&rft_dat=%3Cepo_EVB%3EUS11829870B2%3C/epo_EVB%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true