Deep reinforcement learning based models for hard-exploration problems
A self-driving vehicle implements a deep reinforcement learning based model. The self-driving vehicle comprise one or more sensors configured to capture sensor data of an environment of the self-driving vehicle, a control system configured to navigate the self-driving vehicle, and a controller to de...
Gespeichert in:
Hauptverfasser: | , , , , |
---|---|
Format: | Patent |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | |
---|---|
container_issue | |
container_start_page | |
container_title | |
container_volume | |
creator | Huizinga, Joost Lehman, Joel Anthony Stanley, Kenneth Owen Ecoffet, Adrien Lucas Clune, Jeffrey Michael |
description | A self-driving vehicle implements a deep reinforcement learning based model. The self-driving vehicle comprise one or more sensors configured to capture sensor data of an environment of the self-driving vehicle, a control system configured to navigate the self-driving vehicle, and a controller to determine and provide instructions to the control system. The controller implements a deep reinforcement learning based model that inputs the sensor data captured by the sensors to determine actions to perform by the control system. The model includes an archive storing states reachable by an agent in a training environment, each state stored in the archive is associated with a trajectory for reaching the state. The archive is generated by visiting states stored in the archive and performing actions to explore and find new states. New states are stored in the archive with their trajectories. |
format | Patent |
fullrecord | <record><control><sourceid>epo_EVB</sourceid><recordid>TN_cdi_epo_espacenet_US11829870B2</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>US11829870B2</sourcerecordid><originalsourceid>FETCH-epo_espacenet_US11829870B23</originalsourceid><addsrcrecordid>eNqNyjsOwjAQRVE3FAjYw7CASCQUhJZPRA_U0SR-CZZsjzV2wfKhYAFUtzh3aboLkEjh4iQ6IiAW8mCNLs40cIalIBY-09fpxWorvJMX5eIkUlIZPEJem8XEPmPz68psu-vjfKuQpEdOPCKi9M97XbfNsT3sTs3-n-cDYAk0Nw</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>patent</recordtype></control><display><type>patent</type><title>Deep reinforcement learning based models for hard-exploration problems</title><source>esp@cenet</source><creator>Huizinga, Joost ; Lehman, Joel Anthony ; Stanley, Kenneth Owen ; Ecoffet, Adrien Lucas ; Clune, Jeffrey Michael</creator><creatorcontrib>Huizinga, Joost ; Lehman, Joel Anthony ; Stanley, Kenneth Owen ; Ecoffet, Adrien Lucas ; Clune, Jeffrey Michael</creatorcontrib><description>A self-driving vehicle implements a deep reinforcement learning based model. The self-driving vehicle comprise one or more sensors configured to capture sensor data of an environment of the self-driving vehicle, a control system configured to navigate the self-driving vehicle, and a controller to determine and provide instructions to the control system. The controller implements a deep reinforcement learning based model that inputs the sensor data captured by the sensors to determine actions to perform by the control system. The model includes an archive storing states reachable by an agent in a training environment, each state stored in the archive is associated with a trajectory for reaching the state. The archive is generated by visiting states stored in the archive and performing actions to explore and find new states. New states are stored in the archive with their trajectories.</description><language>eng</language><subject>CALCULATING ; COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS ; COMPUTING ; CONJOINT CONTROL OF VEHICLE SUB-UNITS OF DIFFERENT TYPE ORDIFFERENT FUNCTION ; CONTROL OR REGULATING SYSTEMS IN GENERAL ; CONTROL SYSTEMS SPECIALLY ADAPTED FOR HYBRID VEHICLES ; CONTROLLING ; COUNTING ; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS ; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS ORELEMENTS ; PERFORMING OPERATIONS ; PHYSICS ; REGULATING ; ROAD VEHICLE DRIVE CONTROL SYSTEMS FOR PURPOSES NOT RELATED TOTHE CONTROL OF A PARTICULAR SUB-UNIT ; SYSTEMS FOR CONTROLLING OR REGULATING NON-ELECTRIC VARIABLES ; TRANSPORTING ; VEHICLES IN GENERAL</subject><creationdate>2023</creationdate><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20231128&DB=EPODOC&CC=US&NR=11829870B2$$EHTML$$P50$$Gepo$$Hfree_for_read</linktohtml><link.rule.ids>230,308,776,881,25542,76290</link.rule.ids><linktorsrc>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20231128&DB=EPODOC&CC=US&NR=11829870B2$$EView_record_in_European_Patent_Office$$FView_record_in_$$GEuropean_Patent_Office$$Hfree_for_read</linktorsrc></links><search><creatorcontrib>Huizinga, Joost</creatorcontrib><creatorcontrib>Lehman, Joel Anthony</creatorcontrib><creatorcontrib>Stanley, Kenneth Owen</creatorcontrib><creatorcontrib>Ecoffet, Adrien Lucas</creatorcontrib><creatorcontrib>Clune, Jeffrey Michael</creatorcontrib><title>Deep reinforcement learning based models for hard-exploration problems</title><description>A self-driving vehicle implements a deep reinforcement learning based model. The self-driving vehicle comprise one or more sensors configured to capture sensor data of an environment of the self-driving vehicle, a control system configured to navigate the self-driving vehicle, and a controller to determine and provide instructions to the control system. The controller implements a deep reinforcement learning based model that inputs the sensor data captured by the sensors to determine actions to perform by the control system. The model includes an archive storing states reachable by an agent in a training environment, each state stored in the archive is associated with a trajectory for reaching the state. The archive is generated by visiting states stored in the archive and performing actions to explore and find new states. New states are stored in the archive with their trajectories.</description><subject>CALCULATING</subject><subject>COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS</subject><subject>COMPUTING</subject><subject>CONJOINT CONTROL OF VEHICLE SUB-UNITS OF DIFFERENT TYPE ORDIFFERENT FUNCTION</subject><subject>CONTROL OR REGULATING SYSTEMS IN GENERAL</subject><subject>CONTROL SYSTEMS SPECIALLY ADAPTED FOR HYBRID VEHICLES</subject><subject>CONTROLLING</subject><subject>COUNTING</subject><subject>FUNCTIONAL ELEMENTS OF SUCH SYSTEMS</subject><subject>MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS ORELEMENTS</subject><subject>PERFORMING OPERATIONS</subject><subject>PHYSICS</subject><subject>REGULATING</subject><subject>ROAD VEHICLE DRIVE CONTROL SYSTEMS FOR PURPOSES NOT RELATED TOTHE CONTROL OF A PARTICULAR SUB-UNIT</subject><subject>SYSTEMS FOR CONTROLLING OR REGULATING NON-ELECTRIC VARIABLES</subject><subject>TRANSPORTING</subject><subject>VEHICLES IN GENERAL</subject><fulltext>true</fulltext><rsrctype>patent</rsrctype><creationdate>2023</creationdate><recordtype>patent</recordtype><sourceid>EVB</sourceid><recordid>eNqNyjsOwjAQRVE3FAjYw7CASCQUhJZPRA_U0SR-CZZsjzV2wfKhYAFUtzh3aboLkEjh4iQ6IiAW8mCNLs40cIalIBY-09fpxWorvJMX5eIkUlIZPEJem8XEPmPz68psu-vjfKuQpEdOPCKi9M97XbfNsT3sTs3-n-cDYAk0Nw</recordid><startdate>20231128</startdate><enddate>20231128</enddate><creator>Huizinga, Joost</creator><creator>Lehman, Joel Anthony</creator><creator>Stanley, Kenneth Owen</creator><creator>Ecoffet, Adrien Lucas</creator><creator>Clune, Jeffrey Michael</creator><scope>EVB</scope></search><sort><creationdate>20231128</creationdate><title>Deep reinforcement learning based models for hard-exploration problems</title><author>Huizinga, Joost ; Lehman, Joel Anthony ; Stanley, Kenneth Owen ; Ecoffet, Adrien Lucas ; Clune, Jeffrey Michael</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-epo_espacenet_US11829870B23</frbrgroupid><rsrctype>patents</rsrctype><prefilter>patents</prefilter><language>eng</language><creationdate>2023</creationdate><topic>CALCULATING</topic><topic>COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS</topic><topic>COMPUTING</topic><topic>CONJOINT CONTROL OF VEHICLE SUB-UNITS OF DIFFERENT TYPE ORDIFFERENT FUNCTION</topic><topic>CONTROL OR REGULATING SYSTEMS IN GENERAL</topic><topic>CONTROL SYSTEMS SPECIALLY ADAPTED FOR HYBRID VEHICLES</topic><topic>CONTROLLING</topic><topic>COUNTING</topic><topic>FUNCTIONAL ELEMENTS OF SUCH SYSTEMS</topic><topic>MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS ORELEMENTS</topic><topic>PERFORMING OPERATIONS</topic><topic>PHYSICS</topic><topic>REGULATING</topic><topic>ROAD VEHICLE DRIVE CONTROL SYSTEMS FOR PURPOSES NOT RELATED TOTHE CONTROL OF A PARTICULAR SUB-UNIT</topic><topic>SYSTEMS FOR CONTROLLING OR REGULATING NON-ELECTRIC VARIABLES</topic><topic>TRANSPORTING</topic><topic>VEHICLES IN GENERAL</topic><toplevel>online_resources</toplevel><creatorcontrib>Huizinga, Joost</creatorcontrib><creatorcontrib>Lehman, Joel Anthony</creatorcontrib><creatorcontrib>Stanley, Kenneth Owen</creatorcontrib><creatorcontrib>Ecoffet, Adrien Lucas</creatorcontrib><creatorcontrib>Clune, Jeffrey Michael</creatorcontrib><collection>esp@cenet</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Huizinga, Joost</au><au>Lehman, Joel Anthony</au><au>Stanley, Kenneth Owen</au><au>Ecoffet, Adrien Lucas</au><au>Clune, Jeffrey Michael</au><format>patent</format><genre>patent</genre><ristype>GEN</ristype><title>Deep reinforcement learning based models for hard-exploration problems</title><date>2023-11-28</date><risdate>2023</risdate><abstract>A self-driving vehicle implements a deep reinforcement learning based model. The self-driving vehicle comprise one or more sensors configured to capture sensor data of an environment of the self-driving vehicle, a control system configured to navigate the self-driving vehicle, and a controller to determine and provide instructions to the control system. The controller implements a deep reinforcement learning based model that inputs the sensor data captured by the sensors to determine actions to perform by the control system. The model includes an archive storing states reachable by an agent in a training environment, each state stored in the archive is associated with a trajectory for reaching the state. The archive is generated by visiting states stored in the archive and performing actions to explore and find new states. New states are stored in the archive with their trajectories.</abstract><oa>free_for_read</oa></addata></record> |
fulltext | fulltext_linktorsrc |
identifier | |
ispartof | |
issn | |
language | eng |
recordid | cdi_epo_espacenet_US11829870B2 |
source | esp@cenet |
subjects | CALCULATING COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS COMPUTING CONJOINT CONTROL OF VEHICLE SUB-UNITS OF DIFFERENT TYPE ORDIFFERENT FUNCTION CONTROL OR REGULATING SYSTEMS IN GENERAL CONTROL SYSTEMS SPECIALLY ADAPTED FOR HYBRID VEHICLES CONTROLLING COUNTING FUNCTIONAL ELEMENTS OF SUCH SYSTEMS MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS ORELEMENTS PERFORMING OPERATIONS PHYSICS REGULATING ROAD VEHICLE DRIVE CONTROL SYSTEMS FOR PURPOSES NOT RELATED TOTHE CONTROL OF A PARTICULAR SUB-UNIT SYSTEMS FOR CONTROLLING OR REGULATING NON-ELECTRIC VARIABLES TRANSPORTING VEHICLES IN GENERAL |
title | Deep reinforcement learning based models for hard-exploration problems |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-31T07%3A55%3A25IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-epo_EVB&rft_val_fmt=info:ofi/fmt:kev:mtx:patent&rft.genre=patent&rft.au=Huizinga,%20Joost&rft.date=2023-11-28&rft_id=info:doi/&rft_dat=%3Cepo_EVB%3EUS11829870B2%3C/epo_EVB%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true |