Multi-target path planning method based on improved SAC algorithm
The invention belongs to the field of reinforcement learning, and particularly relates to a multi-target path planning method based on an improved SAC algorithm. According to the method, sufficient path experience of the robot reaching each shelf position is planned to be collected, and supervised l...
Gespeichert in:
Hauptverfasser: | , , , |
---|---|
Format: | Patent |
Sprache: | chi ; eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | |
---|---|
container_issue | |
container_start_page | |
container_title | |
container_volume | |
creator | HAN HUIYAN PANG MIN ZHENG XINYI SUN FUSHENG |
description | The invention belongs to the field of reinforcement learning, and particularly relates to a multi-target path planning method based on an improved SAC algorithm. According to the method, sufficient path experience of the robot reaching each shelf position is planned to be collected, and supervised learning assistance is carried out by reading offline expert experience before actual distribution, so that the distribution efficiency is improved; on the basis of preferential experience playback of the SumTree, the adopted rate of effective path sample experience is increased; based on a calculation reward mechanism of multi-step TD-error, subsequent multi-step rewards are comprehensively considered. According to the method, robot navigation and the SAC algorithm in reinforcement learning are combined, the limitation of a traditional path planning algorithm on a model is eliminated, the learning speed of the robot and the utilization efficiency of experience samples are improved, and the problems that the optimal |
format | Patent |
fullrecord | <record><control><sourceid>epo_EVB</sourceid><recordid>TN_cdi_epo_espacenet_CN116858248A</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>CN116858248A</sourcerecordid><originalsourceid>FETCH-epo_espacenet_CN116858248A3</originalsourceid><addsrcrecordid>eNrjZHD0Lc0pydQtSSxKTy1RKEgsyVAoyEnMy8vMS1fITS3JyE9RSEosTk1RyM9TyMwtKMovA7KDHZ0VEnPS84sySzJyeRhY0xJzilN5oTQ3g6Kba4izh25qQX58anFBYnJqXmpJvLOfoaGZhamFkYmFozExagDhgzFi</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>patent</recordtype></control><display><type>patent</type><title>Multi-target path planning method based on improved SAC algorithm</title><source>esp@cenet</source><creator>HAN HUIYAN ; PANG MIN ; ZHENG XINYI ; SUN FUSHENG</creator><creatorcontrib>HAN HUIYAN ; PANG MIN ; ZHENG XINYI ; SUN FUSHENG</creatorcontrib><description>The invention belongs to the field of reinforcement learning, and particularly relates to a multi-target path planning method based on an improved SAC algorithm. According to the method, sufficient path experience of the robot reaching each shelf position is planned to be collected, and supervised learning assistance is carried out by reading offline expert experience before actual distribution, so that the distribution efficiency is improved; on the basis of preferential experience playback of the SumTree, the adopted rate of effective path sample experience is increased; based on a calculation reward mechanism of multi-step TD-error, subsequent multi-step rewards are comprehensively considered. According to the method, robot navigation and the SAC algorithm in reinforcement learning are combined, the limitation of a traditional path planning algorithm on a model is eliminated, the learning speed of the robot and the utilization efficiency of experience samples are improved, and the problems that the optimal</description><language>chi ; eng</language><subject>CALCULATING ; COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS ; COMPUTING ; COUNTING ; GYROSCOPIC INSTRUMENTS ; MEASURING ; MEASURING DISTANCES, LEVELS OR BEARINGS ; NAVIGATION ; PHOTOGRAMMETRY OR VIDEOGRAMMETRY ; PHYSICS ; SURVEYING ; TESTING</subject><creationdate>2023</creationdate><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20231010&DB=EPODOC&CC=CN&NR=116858248A$$EHTML$$P50$$Gepo$$Hfree_for_read</linktohtml><link.rule.ids>230,308,780,885,25564,76547</link.rule.ids><linktorsrc>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20231010&DB=EPODOC&CC=CN&NR=116858248A$$EView_record_in_European_Patent_Office$$FView_record_in_$$GEuropean_Patent_Office$$Hfree_for_read</linktorsrc></links><search><creatorcontrib>HAN HUIYAN</creatorcontrib><creatorcontrib>PANG MIN</creatorcontrib><creatorcontrib>ZHENG XINYI</creatorcontrib><creatorcontrib>SUN FUSHENG</creatorcontrib><title>Multi-target path planning method based on improved SAC algorithm</title><description>The invention belongs to the field of reinforcement learning, and particularly relates to a multi-target path planning method based on an improved SAC algorithm. According to the method, sufficient path experience of the robot reaching each shelf position is planned to be collected, and supervised learning assistance is carried out by reading offline expert experience before actual distribution, so that the distribution efficiency is improved; on the basis of preferential experience playback of the SumTree, the adopted rate of effective path sample experience is increased; based on a calculation reward mechanism of multi-step TD-error, subsequent multi-step rewards are comprehensively considered. According to the method, robot navigation and the SAC algorithm in reinforcement learning are combined, the limitation of a traditional path planning algorithm on a model is eliminated, the learning speed of the robot and the utilization efficiency of experience samples are improved, and the problems that the optimal</description><subject>CALCULATING</subject><subject>COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS</subject><subject>COMPUTING</subject><subject>COUNTING</subject><subject>GYROSCOPIC INSTRUMENTS</subject><subject>MEASURING</subject><subject>MEASURING DISTANCES, LEVELS OR BEARINGS</subject><subject>NAVIGATION</subject><subject>PHOTOGRAMMETRY OR VIDEOGRAMMETRY</subject><subject>PHYSICS</subject><subject>SURVEYING</subject><subject>TESTING</subject><fulltext>true</fulltext><rsrctype>patent</rsrctype><creationdate>2023</creationdate><recordtype>patent</recordtype><sourceid>EVB</sourceid><recordid>eNrjZHD0Lc0pydQtSSxKTy1RKEgsyVAoyEnMy8vMS1fITS3JyE9RSEosTk1RyM9TyMwtKMovA7KDHZ0VEnPS84sySzJyeRhY0xJzilN5oTQ3g6Kba4izh25qQX58anFBYnJqXmpJvLOfoaGZhamFkYmFozExagDhgzFi</recordid><startdate>20231010</startdate><enddate>20231010</enddate><creator>HAN HUIYAN</creator><creator>PANG MIN</creator><creator>ZHENG XINYI</creator><creator>SUN FUSHENG</creator><scope>EVB</scope></search><sort><creationdate>20231010</creationdate><title>Multi-target path planning method based on improved SAC algorithm</title><author>HAN HUIYAN ; PANG MIN ; ZHENG XINYI ; SUN FUSHENG</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-epo_espacenet_CN116858248A3</frbrgroupid><rsrctype>patents</rsrctype><prefilter>patents</prefilter><language>chi ; eng</language><creationdate>2023</creationdate><topic>CALCULATING</topic><topic>COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS</topic><topic>COMPUTING</topic><topic>COUNTING</topic><topic>GYROSCOPIC INSTRUMENTS</topic><topic>MEASURING</topic><topic>MEASURING DISTANCES, LEVELS OR BEARINGS</topic><topic>NAVIGATION</topic><topic>PHOTOGRAMMETRY OR VIDEOGRAMMETRY</topic><topic>PHYSICS</topic><topic>SURVEYING</topic><topic>TESTING</topic><toplevel>online_resources</toplevel><creatorcontrib>HAN HUIYAN</creatorcontrib><creatorcontrib>PANG MIN</creatorcontrib><creatorcontrib>ZHENG XINYI</creatorcontrib><creatorcontrib>SUN FUSHENG</creatorcontrib><collection>esp@cenet</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>HAN HUIYAN</au><au>PANG MIN</au><au>ZHENG XINYI</au><au>SUN FUSHENG</au><format>patent</format><genre>patent</genre><ristype>GEN</ristype><title>Multi-target path planning method based on improved SAC algorithm</title><date>2023-10-10</date><risdate>2023</risdate><abstract>The invention belongs to the field of reinforcement learning, and particularly relates to a multi-target path planning method based on an improved SAC algorithm. According to the method, sufficient path experience of the robot reaching each shelf position is planned to be collected, and supervised learning assistance is carried out by reading offline expert experience before actual distribution, so that the distribution efficiency is improved; on the basis of preferential experience playback of the SumTree, the adopted rate of effective path sample experience is increased; based on a calculation reward mechanism of multi-step TD-error, subsequent multi-step rewards are comprehensively considered. According to the method, robot navigation and the SAC algorithm in reinforcement learning are combined, the limitation of a traditional path planning algorithm on a model is eliminated, the learning speed of the robot and the utilization efficiency of experience samples are improved, and the problems that the optimal</abstract><oa>free_for_read</oa></addata></record> |
fulltext | fulltext_linktorsrc |
identifier | |
ispartof | |
issn | |
language | chi ; eng |
recordid | cdi_epo_espacenet_CN116858248A |
source | esp@cenet |
subjects | CALCULATING COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS COMPUTING COUNTING GYROSCOPIC INSTRUMENTS MEASURING MEASURING DISTANCES, LEVELS OR BEARINGS NAVIGATION PHOTOGRAMMETRY OR VIDEOGRAMMETRY PHYSICS SURVEYING TESTING |
title | Multi-target path planning method based on improved SAC algorithm |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-06T23%3A28%3A46IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-epo_EVB&rft_val_fmt=info:ofi/fmt:kev:mtx:patent&rft.genre=patent&rft.au=HAN%20HUIYAN&rft.date=2023-10-10&rft_id=info:doi/&rft_dat=%3Cepo_EVB%3ECN116858248A%3C/epo_EVB%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true |